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CHEMICAL INHIBITORS OF MISMATCH REPAIR 

TECHNICAL FIELD OF THE INVENTION 

The invention is related to the area of mutagenesis. In particular it is related to the 
field of blocking specific DNA repair processes. 

BACKGROUND OF THE INVENTION 

Mismatch repair (MMR) is a conserved DNA repair process that is involved in 
post-replicative repair of mutated DNA sequences that occurs after genome replication. 
The process involves a group of gene products, including the mutS homologs GTBP, 
HMSH2. and HMSH3 and the mutL homologs hMLHl, hPMSl, and hPMS2 (Bronner, C.E. 
et al (1994) Nature 368:258-261; Papadopoulos, N. et al (1994) Science 263:1625-1629; 
Leach, F.S. et al. (1993) Cell 75:1215-1225; Nicolaides, N.C. et al. (1994) Nature 
371 :75-80) that work in concert to correct mispaired mono-, di-, and tri-nucleotides, point 
15 mutations, and to monitor for correct homologous recombination. Germline mutations in 
any of the genes involved in this process results in global point mutations, and instability 
of mono, di and tri-nucleotide repeats (a feature referred to as microsatellite instability 
(MI)), throughout the genome of the host cell. In man, genetic defects in MMR results in 
the predisposition to hereditary nonpolyposis colon cancer, a disease in which tumors 
retain a diploid genome but have widespread MI (Bronner, C.E. et al (1994) Nature 
368:258-261; Papadopoulos, N. et al (1994) Science 263:1625-1629; Leach, F.S. et al 
(1993) Cell 75:1215-1225; Nicolaides, N.C. et al. (1994) Nature 371:75-80; Harfe B.D., 
and S. Jinks-Robertson (2000) An. Re\>. Genet. 34:359-399; Modrich, P. (1994) Science 
266:1959-1960). Though the mutator defect that arises from MMR deficiency can affect 
any DNA sequence, microsatellite sequences are particularly sensitive to MMR 
abnormalities (Peinado, M.A. et a/.(1992) Proc. Natl Acad. Sci. USA 89:10065-10069). 
Microsatellite instability is therefore a useful indicator of defective MMR. In addition to 
its occurrence in virtually all tumors arising in HNPCC patients, MI is found in a small 
fraction of sporadic tumors with distinctive molecular and phenotypic properties that is 
due to defective MMR (Perucho, M. (1996)^. Chem. 377:675-684). 

MMR deficiency leads to a wide spectrum of mutations (point mutations, 
insertions, deletions, recombination, etc.) that can occur throughout the genome of a host 
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cell. This effect has been found to occur across a diverse array of organisms ranging from 
but not limited to unicellular microbes, such as bacteria and yeast, to more complex 
organisms such as Drosophila and mammals, including mice and humans (Harfe B.D., and 
S. Jinks-Robertson (2000) An. Rev. Genet. 34:359-399; Modrich, P. (1994) Science 
5 266:1959-1960). The ability to block MMR in a normal host cell or organism can result in 
the generation of genetically altered offspring or sibling cells that have desirable output 
traits for applications such as but not limited to agriculture, pharmaceutical, chemical 
manufacturing and specialty goods. A chemical method that can block the MMR process 
is beneficial for generating genetically altered hosts with commercially valuable output 

10 traits. A chemical strategy for blocking MMR in vivo offers a great advantage over a 

recombinant approach for producing genetically altered host organisms. One advantage is 
that a chemical approach bypasses the need for introducing foreign DNA into a host, 
resulting in a rapid approach for inactivating MMR and generating genetically diverse 
offspring or sib cells. Moreover, a chemical process is highly regulated in that once a host 

15 organism with a desired output trait is generated, the chemical is removed from the host 
and its MMR process would be restored, thus fixing the genetic alteration in subsequent 
generations. The invention described herein is.directed to the discovery of small 
molecules that are capable of blocking MMR, thus resulting in host organisms with MI, a 
hallmark of MMR deficiency (Peinado, M.A. et al (1992) Proc. Natl Acad. Set USA 

20 89:10065-10069; Perucho, M. (1996) Biol Chem. 377:675-684; Wheeler, J.M. et al. 

(2000) J. Med. Genet 37:5S8-592; Hoang, J.M. et al (1997) Cancer Res. 57:300-303). 
Moreover, host organisms exhibiting MI are then selected for to identify subtypes with 
new output traits, such as but not limited to mutant nucleic acid molecules, polypeptides, 
biochemicals, physical appearance at the microscopic and/or macroscopic level, or 

25 phenotypic alterations in a whole organism. In addition, the ability to develop MMR 
defective host cells by a chemical agent provides a valuable method for creating 
genetically altered cell hosts for product development. The invention described herein is 
directed to the creation of genetically altered cell hosts via the blockade of MMR using 
chemical agents in vivo. 

30 The advantages of the present invention are further described in the examples and 

figures described within this document. 
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SUMMARY OF THE INVENTION 

The invention provides methods for rendering cells hypermutable by blocking 
MMR activity with chemical agents. 

The invention also provides genetically altered cell lines which have mutations 
introduced through interruption of mismatch repair. 

The invention further provides methods to produce an enhanced rate of genetic 
hypermutation in a cell. 

The invention encompasses methods of mutating a gene of interest in a cell, 
methods of creating cells with new phenotypes, and methods of creating cells with new 
phenotypes and a stable genome. 

The invention also provides methods of creating genetically altered whole 
organisms and methods of creating whole organisms with new phenotypes. 

These and other objects of the invention are provided by one or more of the 
15 embodiments described below. 

In one embodiment of the invention, a method for screening chemical compounds 
that block mismatch repair (MMR) is provided. An MMR-sensitive reporter gene 
containing an out-of-frame polynucleotide repeat in its coding region is introduced into an 
MMRproficientcell. The cell is grown in the presence of chemicals. Chemicals that alter 
the genetic structure of the polynucleotide repeat yield a biologically active reporter *ene 
product. Chemicals that disrupt the polynucleotide repeat are identified as MMR blowing 
agents. 

In another embodiment of the invention, an isolated MMR blocking chemical is 
provided. The chemical can block MMR of a host cell, yielding a cell that exhibits an 
25 enhanced rate of hypermutation. 

In another embodiment of the invention, a method is provided for introducing a 
mutahon into a gene of interest. A chemical that blocks mismatch repair is added to the 
culture of a cell line. The cells become hypermutable as a result of the introduction of the 
chenucal. The cell further comprises a gene of interest. The cell is cultured and tested to 
30 determine whether the gene of interest harbors a mutation. 



20 



In another embodiment of the invention, a method is provided for producing 
Phenotypes of a cell. A chemical that blocks mismatch repair is added to a cell culture 
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The cell becomes hypermutable as a result of the introduction of the chemical. The cell is 
cultured and tested for the expression of new phenotypes. 

In another embodiment of the invention, a method is provided for restoring genetic 
stability in a cell ir which mismatch repair is blocked via a chemical agent. The chemical 
5 is removed from the cell culture and the cell restores its genetic stability. 

In another embodiment of the invention, a method is provided for restoring genetic 
stability in a cell with blocked mismatch repair and a newly selected phenotype. The 
chemical agent is removed from the cell culture and the cell restores its genetic stability 
and the new phenotype is stable. 
10 In another embodiment of the invention, a chemical method for blocking MMR in 

plants is provided. The plant is grown in the presence of a chemical agent. The plant is 
grown and exhibits an enhanced rate of hypermutation. 

In another embodiment of the invention, a method for screening chemical 
inhibitors of MMR in plants in vivo is provided. MMR-sensitive plant expression vectors 
15 are engineered. The reporter vectors are introduced into plant hosts. The plant is grown in 
the presence of a chemical agent. The plant is monitored for altered reporter gene 
function. 

In another embodiment of the invention, a method is provided for introducing a 
mutation into a gene of interest in a plant. A chemical that blocks mismatch repair is 
20 added to a plant. The plant becomes hypermutable as a result of the introduction of the 
chemical. The plant further comprises a gene of interest. The plant is grown. The plant is 
tested to determine whether the gene of interest harbors a mutation. 

In another embodiment of the invention, a method is provided for producing new 
phenotypes of a plant. A chemical that blocks mismatch repair is added to a plant. The 
25 plant becomes hypermutable as a result of the introduction of the chemical. The plant is 
grown and tested for the expression of new phenotypes. 

In another embodiment of the invention, a method is provided for restoring genetic 
stability in a plant in which mismatch repair is blocked via a chemical agent. The 
chemical is removed from the plant culture and the plant restores its genetic stability. 
30 In another embodiment of the invention, a method is provided for restoring genetic 

stability in a plant with blocked mismatch repair and a newly selected phenotype. The 
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chemical agent is removed from the plant culture and the plant restores its genetic stability 
and the new phenotype is stable. 

These and other embodiments of the invention provide the art with methods that 
can generate enhanced mutability in microbes, organisms of the protista class, insect cells, 
mammalian cells, plants, and animals as well as providing cells, plants and animals 
harboring potentially useful mutations. 

BRIEF DESCRIPTION OF THE DRAWINGS 
Figure 1 shows diagrams of mismatch repair (MMR) sensitive reporter genes. 
Figure 2 shows a screening method for identifying MMR blocking chemicals. 
Figure 3 shows identification of a small chemical that blocks MMR and genetically alters 
the pCAR-OF vector in vivo. 

Figure 4 shows shifting of endogenous microsatellites in human cells induced by a 
chemical inhibitor of MMR. 

15 Figure 5 shows sequence analysis of microsatellites from cells treated with chemical 
inhibitors of MMR with altered repeats. 

Figure 6 shows generation of host organisms with new phenotypes using a chemical 
blocker of MMR. 

Figure 7 shows a schematic diagram of MMR-sensitive reporter gene for plants. 
20 Figure 8 shows derivatives of lead compounds and thereof that are inhibitors of MMR in 
vivo. 

DETAILED DESCRIPTION OF THE INVENTION 

Various definitions are provided herein. Most words and terms have the meaning 
25 that would be attributed to those words by one skilled in the art. Words or terms 
specifically defined herein have the meaning provided in the context of the present 
invention as a whole and as are typically understood by those skilled in the art. Any 
conflict between an art-understood definition of a word or term and a definition of the 
word or term as specifically taught herein shall be resolved in favor of the latter. Headings 
30 used herein are for convenience and are not to be construed as limiting. 

As used herein the term "anthracene" refers to the compound anthracene. However, 
when referred to in the general sense, such as "anthracenes," "an anthracene" or "the' 
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anthracene such terms denote any compound that contains the fused triphenyl core structure 
of anthracene, i.e., 




5 In certain preferred embodiments of the invention, the anthracene has the formula: 

wherein RrR 10 are independently hydrogen, hydroxyl, amino, alkyl, substituted alkyl, alkenyl, 
substituted alkenyl, alkynyl, substituted alkynyl, O-alkyl, S-alkyl, N-alkyl, O-alkenyl, S- 
alkenyl, N-alkenyl,0-alkynyl, S-alkynyl, N-alkynyl, aryl, substituted aryl, aryloxy, substituted 
aryloxy, heteroaryl, substituted heteroaryl, aralkyloxy, arylalkyl, alkylaryl, alkylaryloxy, 
10 arylsulfonyl, alkylsulfonyl, alkoxycarbonyl, aryloxycarbonyl, guanidino, caiboxy, an alcohol, 
an amino acid, sulfonate, alkyl sulfonate, CN, N0 2 , an aldehyde group, an ester, an ether, a 
crown ether, a ketone, an organosulfur compound, an organometallic group, a carboxylic acid, 
an organosilicon or a carbohydrate that optionally contains one or more alkylated hydroxyl 
groups; 

15 wherein said heteroalkyl, heteroaryl, and substituted heteroaryl contain at least one 

heteroatom that is oxygen, sulfur, a metal atom, phosphorus, silicon or nitrogen; and 

wherein said substituents of said substituted alkyl, substituted alkenyl, substituted 
alkynyl, 

substituted aryl, and substituted heteroaryl are halogen, CN, N0 2 , lower alkyl, aryl, heteroaryl, 
20 aralkyl, aralkyloxy, guanidino, alkoxycarbonyl, alkoxy, hydroxy, carboxy and amino; 

and wherein said amino groups optionally substituted with an acyl group, or 1 to 3 aryl 
or lower alkyl groups; 

or wherein any two of R r R 10 can together form a polyether; 

or wherein any two of R r R I0 can, together with the intervening carbon atoms of the 
anthracene core, form a crown ether. 
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As used herein, "alkyl" refers to a hydrocarbon containing from 1 to about 20 carbon 
atoms. Alkyl groups may straight, branched, cyclic, or combinations thereof. Alkyl groups 
thus include, by way of illustration only, methyl, ethyl, propyl, isopropyl, butyl, isobutyl, 
cyclopentyl, cyclopentylmethyl, cyclohexyl, cyclohexylmethyl, and the like. Also included 
5 within the definition of "alkyl" are fused and/or polycyclic aliphatic cyclic ring systems such 
as, for example, adamantane. As used herein the term "alkenyl" denotes an alkyl group 
having at least one carbon-carbon double bond. As used herein the term "alkynyl" denotes 
an alkyl group having at least one carbon-carbon triple bond. 

In some preferred embodiments, the alkyl, alkenyl, alkynyl, aryl, aryloxy, and 
10 heteroaryl substituent groups described above may bear one or more further substituent 
groups; that is, they may be "substituted". In some preferred embodiments these substituent 
groups can include halogens (for example fluorine, chlorine, bromine and iodine), CN, N0 2 , 
lower alkyl groups, aryl groups, heteroaryl groups, aralkyl groups, aralkyloxy groups, 
guanidino, alkoxycarbonyl, alkoxy, hydroxy, carboxy and amino groups. In addition, the 
alkyl and aryl portions of aralkyloxy, arylalkyl, arylsulfonyl, alkylsulfonyl, alkoxycarbonyl, 
and aryloxycarbonyl groups also can bear such substituent groups. Thus, by way of example 
only, substituted alkyl groups include, for example, alkyl groups fluoro-, chloro-, bromo- and 
iodoalkyl groups, aniinoalkyl groups, and hydroxyalkyl groups, such as hydroxymethyl, 
hydroxyethyl, hydroxypropyl, hydroxybutyl, and the like. In some preferred embodiments^ 
20 such hydroxyalkyl groups contain from 1 to about 20 carbons. 

As used herein the term "aryl" means a group having 5 to about 20 carbon atoms and 
which contains at least one aromatic ring, such as phenyl, biphenyl and naphthyl. Preferred 
aryl groups include unsubstituted or substituted phenyl and naphthyl groups. The term 
"aryloxy" denotes an aryl group that is bound through an oxygen atom, for example a phenoxy 
25 group. 

In general, the prefix "hetero" denotes the presence of at least one hetero (i.e., non- 
carbon) atom, which is in some preferred embodiments independently one to three O, N, S, 
P, Si or metal atoms. Thus, the term "heteroaryl" denotes an aryl group in which one or more 
ring carbon atom is replaced by such a heteroatom. Preferred heteroaryl groups include 
pyridyl, pyrimidyl, pyrrolyl, furyl, thienyl, and imidazolyl groups. 

The term "aralkyl" (or "arylalkyl") is intended to denote a group having from 6 to 15 
carbons, consisting of an alkyl group that bears an aryl group. Examples of aralkyl groups 
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In general, the prefix "hetero" denotes the presence of at least one hetero (i.e., non- 
carbon) atom, which is in some preferred embodiments independently one to three O, N, S, 
P, Si or metal atoms. Thus, the term "heteroaryl" denotes an aryl group in which one or more 
ring carbon atom is replaced by such a heteroatom. Preferred heteroaryl groups include 
5 pyridyl, pyrimidyl, pyrrolyl, furyl, thienyl, and imidazolyl groups. 

The term "aralkyl" (or "arylalkyl") is intended to denote a group having from 6 to 15 
carbons, consisting of an alkyl group that bears an aryl group. Examples of aralkyl groups 
include benzyl, phenethyl, benzhydryl and naphthylmethyl groups. 

The term "alkylaryl" (or "alkaryF 5 ) is intended to denote a group having from 6 to 15 
10 carbons, consisting of an aryl group that bears an alkyl group. Examples of aralkyl groups 
include methylphenyl, ethylphenyl and methylnaphthyl groups. 

The term "arylsulfonyl" denotes an aryl group attached through a sulfonyl group, for 
example phenylsulfonyl. The term "alkylsulfonyl" denotes an alkyl group attached through 
a sulfonyl group, for example methylsulfonyl. 
15 The term "alkoxycarbonyl" denotes a group of formula -C(=0)-0-R where R is alkyl, 

alkenyl, or alkynyl, where the alkyl, alkenyl, or alkynyl portions thereof can be optionally 
substituted as described herein. 

The term "aiyloxycarbonyl" denotes a group of formula -C(=0)-0-R where R is aryl, 
where the aryl portion thereof can be optionally substituted as described herein. 
20 The terms "arylalkyloxy" or "aralkyloxy" are equivalent, and denote a group of 

formula -O-R'-R" where R' is R is alkyl, alkenyl, or alkynyl which can be optionally 
substituted as described herein, and wherein R" denotes a aryl or substituted aryl group. 

The terms "alkylaryloxy" or "alkaryloxy" are equivalent, and denote a group of 
formula -O-R'-R", where R ; is an aryl or substituted aryl group, and R /; is alkyl, alkenyl, or 
25 alkynyl which can be optionally substituted as described herein. 

As used herein, the term "aldehyde group" denotes a group that bears a moiety of 
formula -C(=0)-H. The term "ketone" denotes a moiety containing a group of formula -R- 
C(=0)-R=, where R and R= are independently alkyl, alkenyl, alkynyl, aryl, heteroaryl, aralkyl, 
or alkaryl, each of which may be substituted as described herein. 
30 As used herein, the term "ester" denotes a moiety having a group of formula -R- 

C(=0)-0-R= or -R-0-C(=0)-R= where R and R= are independently alkyl, alkenyl, alkynyl, 
aryl, heteroaryl, aralkyl, or alkaryl, each of which may be substituted as described herein. 
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The term "ether" denotes a moiety having a group of formula -R-OR= or where R and 
R= are independently alkyl, alkenyl, alkynyl, aryl, heteroaryl, aralkyl, or alkaryl, each of 
which may be substituted as described herein. 

The term "crown ether" has its usual meaning of a cyclic ether containing several 
5 oxygen atoms. As used herein the term "organosulfur compound" denotes aliphatic or 
aromatic sulfur containing compounds, for example thiols and disulfides. The term 
"organometallic group" denotes an organic molecule containing at least one metal atom. 

The term "organosilicon compound" denotes aliphatic or aromatic silicon containing 
compounds, for example alkyl and aryl silanes. 

10 The term "carboxylic acid" denotes a moiety having a carboxyl group, other than an 

amino acid. 

As used herein, the term "amino acid" denotes a molecule containing both an amino 
group and a carboxyl group. In some preferred embodiments, the amino acids are a-, P-, y- 
or 5-amino acids, including their stereoisomers and racemates. As used herein the term "L- 
15 amino acid" denotes an a-amino acid having the L configuration around the a-carbon, that is, 
a carboxylic acid of general formula CHCCOOHXNHXside chain), having the L- 
configuration. The term "D-amino acid" similarly denotes a carboxylic acid of general 
formula CH(COOH)(NH2)-(side chain), having the D-configuration around the a-carbon. Side 
chains of L-amino acids include naturally occurring and non-naturally occurring moieties. 
20 Non-naturally occurring (i.e., unnatural) arnino acid side chains are moieties that are used in 
place of naturally occurring amino acid side chains in, for example, amino acid analogs. See, 
for example, Lehninger, BiochemisUy, Second Edition, Worth Publishers, Inc, 1975, pages 
72-77, incorporated herein by reference. Amino acid substituents may be attached through 
their carbonyl groups through the oxygen or carbonyl carbon thereof, or through their amino 
25 groups, or through functionalities residing on their side chain portions. 

As used herein "polynucleotide" refers to a nucleic acid molecule and includes 
genomic DNA cDNA, RNA, mRNA and the like. 

As used herein "antisense oligonucleotide" refers to a nucleic acid molecule that is 
complementary to at least a portion of a target nucleotide sequence of interest and specifically 
30 hybridizes to the target nucleotide sequence under physiological conditions. 

As used herein "inhibitor of mismatch repair" refers to an agent that interferes with at 
least one function of the mismatch repair system of a cell and thereby renders the cell more 
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susceptible to mutation. 

As used herein "hypermutable" refers to a state in which a cell in vitro or in vivo is 
made more susceptible to mutation through a loss or impairment of the mismatch repair 
system. 

5 As used herein "agents," "chemicals," and "inhibitors" when used in connection 

with inhibition of MMR refers to chemicals, oligonucleotides, analogs of natural 
substrates, and the like that interfere with normal function of MMR. 

Methods for developing hypermutable cells and whole organisms have been 
discovered by taking advantage of the conserved mismatch repair (MMR) process of a 

10 host. Dominant negative alleles of MMR genes, when introduced into cells or transgenic 
animals, increase the rate of spontaneous mutations by reducing the effectiveness of DNA 
repair and thereby render the cells or animals hypermutable. Hypermutable microbes, 
protozoans, insects, mammalian cells, plants or whole animals can then be utilized to 
develop new mutations in a gene of interest. It has been discovered that chemicals that 

15 block MMR, and thereby render cells hypermutable, is an efficient way to introduce 

mutations in cells and genes of interest. In addition to destabilizing the genome of cells 
exposed to chemicals that inhibit MMR activity may be done transiently, allowing cells to 
become hypermutable, and removing the chemical exposure after the desired effect (e.g., a 
mutation in a gene of interest) is achieved. The chemicals that inhibit MMR activity that 

20 are suitable for use in the invention include, but are not limited to, anthracene derivatives, 
nonhydrolyzable ATP analogs, ATPase inhibitors, antisense oligonucleotides that 
specifically anneal to polynucleotides encoding mismatch repair proteins, DNA 
polymerase inhibitors, and exonuclease inhibitors. These chemicals can enhance the rate 
of mutation due to inactivation of MMR yielding clones or subtypes with altered 

25 biochemical properties. Methods for identifying chemical compounds that inhibit MMR 
in vivo are also described herein. 

The process of MMR, also called mismatch proofreading, is carried out by a group 
of protein complexes in cells ranging from bacteria to man (Harfe B.D., and S. Jinks- 
Robertson (2000) An. Rev. Genet. 34:359-399; Modrich, P. (1994) Science 

30 266:1959-1960). An MMR gene is a gene that encodes for one of the proteins of such a 
mismatch repair complex. Although not wanting to be bound by any particular theory of 
mechanism of action, an MMR complex is believed to detect distortions of the DNA helix 
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resulting from non-complementary pairing of nucleotide bases. The non-complementary 
base on the newer DNA strand is excised, and the excised base is replaced with the 
appropriate base, which is complementary to the older DNA strand. In this way, cells 
eliminate many mutations that , ocur as a result of mistakes in DNA replication. 
5 Dominant negative alleles cause an MMR defective phenotype even in the presence 

of a wild-type allele in the same cell. An example of a dominant negative allele of an 
MMR gene is the human gene hPMS2-134 (SEQ ID NO:25), which carries a truncating 
mutation at codon 134 (Nicolaides, N.C. etal. (1998) Mol. Cell. Biol 18:1635-1641). The 
mutation causes the product of this gene to abnormally terminate at the position of the 
10 1 34th amino acid, resulting in a shortened polypeptide containing the N-terminal 133 

amino acids (SEQ ID NO:24). Such a mutation causes an increase in the rate of mutations, 
which accumulate in cells after DNA replication. Expression of a dominant negative allele 
of a mismatch repair gene results in impairment of mismatch repair activity, even in the 
presence of the wild-type allele. 
15 The MMR process has been shown to be blocked by the use of nonhydrolyzable 

forms of ATP (Galio, L. et al. (1999) Nucl. Acids Res. 27:2325-2331; Allen, DJ. et al. 
(1997) EMBOJ. 16:4467-4476; Bjomson, K.P. etal. (2000) Biochem. 39:3176-3183). 
However, it has not been demonstrated that chemicals can block MMR activity in cells. 
Such chemicals can be identified by screening cells for defective MMR activity. Cells 
20 from bacteria, yeast, fungi, insects, plants, animals, and humans can be screened for 
defective mismatch repair. Genomic DNA, cDNA, or rnRNA from any cell can be 
analyzed for variations from the wild type sequences in cells or organisms grown in the 
presence of MMR blocking compounds. Various techniques of screening can be used. 
The suitability of such screening assays, whether natural or artificial, for use in identifying 
25 hypermutable cells, insects, fungi, plants or animals can be evaluated by testing the 

mismatch repair activity caused by a compound or a mixture of compounds, to determine 
if it is an MMR inhibitor. 

A cell, a microbe, or a whole organism such as an insect, fungus, plant or animal in 
which a chemical inhibitor of mismatch repair has been treated will become hypermutable. 
This means that the spontaneous mutation rate of such cells or whole organism is elevated 
compared to cells or animals without such treatment. The degree of elevation of the 
spontaneous mutation rate can be at least 2-fold, 5-fold, 10-fold, 20-fold, 50-fold, 100- 
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fold, 200-fold, 500-fold, or 1000-fold that of the normal cell or animal. The use of 
chemical mutagens such as, but limited to, N-methyl-N , -nitro-N-nitrosoguanidine 
(MNNG), methane sulfonate, dimethyl sulfonate, 06-methyl benzadine, ethyl 
methanesulfonate (EMS), methylnitrosourea (MNU), ethylnitrosourea (ENC), etc. can be 

5 used in MMR defective cells or whole organisms to increase the rates an additional 10 to 
100 fold that of the MMR deficiency itself. 

According to one aspect of the invention, a screening assay for identifying 
chemical inhibitors of MMR is developed and employed. A chemical compound can be in 
any form or class ranging from but not limited to amino acid, steroidal, aromatic, or lipid 

10 precursors. The chemical compound can be naturally occurring or made in the laboratory. 
The screening assay can be natural such as looking for altered endogenous repeats within 
an host organism's genome (as demonstrated in Figs. 4 and 5), or made in the laboratory 
using an MMR-sensitive reporter gene as demonstrated in Figs. 1-3). 

The chemical compound can be introduced into the cell by supplementing the 

15 growth medium, or by intracellular delivery such as but not limited to using microinjection 
or carrier compounds. 

According to another aspect of the invention, a chemical compound from the 
anthracene class can be exposed to MMR proficient cells or whole organism hosts, the host 
is grown and screened for subtypes containing genetically altered genes with new 

20 biochemical features. 

The anthracene compounds that are suitable for use in the invention include, but 
are not limited to anthracenes having the formula: 




wherein Rj-Rjo are independently hydrogen, hydroxyl, amino, alkyl, substituted alkyl, alkenyl, 
25 substituted alkenyl, alkynyl, substituted alkynyl, O-alkyl, S -alkyl, N-alkyl, O-alkenyl, S- 
alkenyl, N-alkenyl s O-alkynyl, S-alkynyl, N-alkynyl, aryl, substituted aryl, aryloxy, substituted 
aryloxy, heteroaiyl, substituted heteroaryl, aralkyloxy, arylalkyl, alkylaryl, alkylaryloxy, 
arylsulfonyl, alkylsulfonyl, alkoxycarbonyl, aiyloxycarbonyl, guanidino, carboxy, an alcohol, 
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an amino acid, sulfonate, alkyl sulfonate, CN, N0 2 , an aldehyde group, an ester, an ether, a 
crown ether, a ketone, an organosulfur compound, an organometallic group, a carboxylic acid, 
an organosilicon or a carbohydrate that optionally contains one or more alkylated hydroxyl 
groups; 

5 wherein said heteroalkyl, heteroaryl, and substituted heteroaryl contain at least one 

heteroatom that is oxygen, sulfur, a metal atom, phosphorus, silicon or nitrogen; and 

wherein said substituents of said substituted alkyl, substituted alkenyl, substituted 
alkynyl, 

substituted aiyl, and substituted heteroaryl are halogen, CN, N0 2 , lower alkyl, aryl, heteroaryl, 
10 aralkyl, aralkyloxy, guanidino, alkoxycarbonyl, alkoxy, hydroxy, carboxy and amino; 

and wherein said amino groups optionally substituted with an acyl group, or 1 to 3 aryl 
or lower alkyl groups; 

or wherein any two of R r R 10 can together form a polyether; 
or wherein any two of R r R 10 can, together with the intervening carbon atoms of the 
15 anthracene core, form a crown ether. 

The method of the invention also encompasses inhibiting MMR with an anthracene 
of the above formula wherein Rs and R^ are hydrogen, and the remaining substituents are as 
described above. 

The some embodiments, in the anthracene compound R^R^ are independently 
20 hydrogen, hydroxyl, alkyl, aryl, arylaklyl, or hydroxyalkyl. In other embodiments, R r R I0 are 

independently hydrogen, hydroxyl, methyl, ethyl, propyl, isopropyl, butyl, isobutyl, phenyl, 

tolyl, hydroxymethyl, hydroxypropyl, or hydroxybutyl. 

In specific embodiments of the invention the anthracenes include, but are not 

limited to 1,2-dimethylanthracene, 9,10-dimethyl anthracene, 7,8-dimethylanthracene, 
25 9,10-diphenylanthracene, 9,10-dihydroxymethylanthracene, 9-hydroxymethyl- 10- 

methylanthracene, dimethylanthracene- 1 ,2-diol, 9-hydroxymethyl- 1 0-methylanthracene- 

1,2-diol, 9-hydroxymethyl- 10-methylanthracene-3,4-diol, 9, 10-di-m-tolyanthracene,.and 

the like. 

The chiral position of the side chains of the anthracenes is not particularly limited 
30 and may be any chiral position and any chiral analog. The anthracenes may also comprise 
a stereoisomer^ forms of the anthracenes and includes any isomeric analog. 
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Examples of hosts are but not limited to cells or whole organisms from human, 
primate, mammal, rodent, plant, fish, reptiles, amphibians, insects, fungi, yeast or 
microbes of prokaryotic origin. 

Yet another aspect of the invention is the use of ATP analogs capable of Mocking 
5 ATPase activity required for MMR. MMR reporter cells are screened with ATP 
compound libraries to identify those compounds capable of blocking MMR in vivo. 
Examples of ATP analogs that are useful in blocking MMR activity include, but are not 
limited to, nonhydrolyzable forms of ATP such as AMP-PNP and ATP [gamma] S block 
the MMR activity (Galio, L. et al (1999) Nucl Acids Res. 27:2325-2331; Allen, D J. et al 

10 (1997) EMBOJ. 16:4467-4476; Bjomson K.P. et al (2000) Biochem. 39:3176-3183). 

Yet another aspect of the invention is the use of nuclease inhibitors that are able to 
block the exonuclease activity of the MMR biochemical pathway. MMR reporter cells are 
screened with nuclease inhibitor compound libraries to identify compounds capable of 
blocking MMR in vivo. Examples of nuclease inhibitors that are useful in blocking MMR 

15 activity include, but are not limited to analogs of N-Ethylmaleimide, an endonuclease 
inhibitor (Huang, Y.C., etal. (1995) Arch. Biochem. Biophys. 316:485), heterodimeric 
adenine-chain-acridine compounds, exonulcease HI inhibitors (Belmont P, et.aL, Bioorg 
Med Chem Lett (2000) 10:293-295), as well as antibiotic compounds such as 
Heliquinomycin, which have helicase inhibitory activity (Chino, M, et.al. J. Antibiot. 

20 (Tokyo) (1998) 51:480-486). 

Another aspect of the invention is the use of DNA polymerase inhibitors that are 
able to block the polymerization required for mismatch-mediated repair. MMR reporter 
cells are screened with DNA polymerase inhibitor compound libraries to identify those 
compounds capable of blocking MMR in vivo. Examples of DNA polymerase inhibitors 

25 that are useful in blocking MMR activity include, but are not limited to, analogs of 
actinomycinD (Martin, S.J., et.al. (1990) J. Immunol 145:1859), Aphidicolin 
(Kuwakado, K. etal. (1993) Biochem. Pharmacol 46:1909) l-(2 t -Deoxy-2 f -fluoro-beta-L- 
arabinofuranosyl)-5-methyluracil (L-FMAU) (Kukhanova M, et.al., Biochem Pharmacol 
(1998) 55:1 181-1 187), and 2 , ,3 , -dideoxyribonucleoside 5*-triphosphates (ddNTPs) (Ono, 

30 K., et.aL, Biomed Pharmacother (1984) 38:382-389). 

In yet another aspect of the invention, antisense oligonucleotides are administered 
to cells to disrupt at least one function of the mismatch repair process. The antisense 
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polynucleotides hybridize to MMR polynucleotides. Both full-length and antisense 
polynucleotide frgaments are suitable for use. "Antisense polynucleotide fragments" of 
the invention include, but are not limited to polynuclotides that specifically hybridize to an 
MMR encoding RNA (as determined by ,-equence comparison of nucleotides encoding the 
5 MMR to nucleotides encoding other known molecules). Identification of sequences that 
are substantially unique to MMR-encoding polynucleotides can be ascertained by analysis 
of any publicly available sequence database and/or with any commercially available 
• sequence comparison programs. Antisense molecules may be generated by any means 
including, but not limited to chemical synthesis, expression in an in vitro transcription 
10 reaction, through expression in a transformed cell comprising a vector that may be 

transcribed to produce antisense molecules, through restriction digestion and isolation, 
through the polymerase chain reaction, and the like. 

Antisense oligonucleotides, or fragments thereof may include the nucleotide 
sequences set forth in SEQ ID NOs:15, 17, 19, 21, 23, 25, 27, and 29 or sequences 
15 complementary or homologous thereto, for example. Those of skill in the art recognize 
that the invention may be predicted using any MMR gene. Specifically, antisense nucleic 
acid molecules comprise a sequence complementary to at least about 10, 15, 25, 50, 100, 
250 or 500 nucleotides or an entire MMR encoding sequence. Preferably, the antisense 
oligonucleotides comprise a sequence complementary to about 15 consecutive nucleotides 
20 of the coding strand of the MMR encoding sequence. 

In one embodiment, an antisense nucleic acid molecule is antisense to a "coding 
region" of the coding strand of a nucleotide sequence encoding an MMR protein. The 
coding strand may also include regulatory regions of the MMR sequence. The term 
"coding region" refers to the region of the nucleotide sequence comprising codons which 
25 are translated into amino acid residues (e.g., the protein coding region of human PMS2 
corresponds to the coding region SEQ ID NO: 17). In another embodiment, the antisense 
nucleic acid molecule is antisense to a "noncoding region" of the coding strand of a 
nucleotide sequence encoding an MMR protein. The term "noncoding region" refers to 5' 
and 3' sequences which flank the coding region that are not translated into amino acids 
30 (i.e., also referred to as 5' and 3' untranslated regions (UTR)). 

Preferably, antisense oligonucleotides are directed to regulatory regions of a 
nucleotide sequence encoding an MMR protein, or mRNA corresponding thereto, 
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including, but not limited to, the initiation codon, TATA box, enhancer sequences, and the 
like. Given the coding strand sequences provided herein, antisense nucleic acids of the 
invention can be designed according to the rules of Watson and Crick or Hoogsteen base 
pairing. The antisense nucleic acid molecule can be complementary to the entire cod.ng 
5 region of an MMR mRNA, but more preferably is an oligonucleotide that is antisense to 
only a portion of the coding or noncoding region of an MMR mRNA. For example, the 
antisense oligonucleotide can be complementary to the region surrounding the translation 
start site of an MMR mRNA. An antisense oligonucleotide can be, for example, about 5, 
10, 15, 20, 25, 30, 35, 40, 45 or 50 nucleotides in length. 

10 Screening is any process whereby a chemical compound is exposed to a cell or 

whole organism. The process of screening can be carried out using but not limited to a 
whole animal, plant, insect, microbe, or by using a suspension of one or more isolated cells 
in culture. The cell can be any type of eukaryotic or prokaryotic cell, including, for 
example, cells isolated from humans or other primates, mammals or other vertebrates, 

15 invertebrates, and single celled organisms such as protozoa, yeast, or bacteria. 

In general, screening will be carried out using a suspension of cells, or a single cell, 
but other methods can also be applied as long as a sufficient fraction of the treated cells or 
tissue is exposed so that isolated cells can be grown and utilized. Techniques for chemical 
screening are well known to those in the art. Available techniques for screening include 

20 cell-based assays, molecular assays, and whole organism-based assays. Compounds can 
be added to the screening assays of the invention in order to identify those agents that are 
capable of blocking MMR in cells. 

The screening assays of the invention provide a system wherein a cell, cells or a 
whole organism is contacted with a candidate compound and then tested to determine 

25 whether mismatch repair has been adversely affected. The method in which MMR is 
analyzed may be any known method, including, but not limited to analysis of the 
molecular sequence of the MMR gene, and analyzing endogenous repeats in the subject's 
genome. Further, the invention provides a convenient assay to analyze the effects of 
candidate agents on reporter genes transfected into cells. 

30 MMR-inhibitors identified by the methods of the invention can be used to generate 

new mutations in one or more gene(s) of interest. A gene of interest can be any gene 
naturally possessed by a cell line, microbe or whole organism. An advantage of using 
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chemicals rather than recombinant technologies to block MMR are that the process is 
faster; there is no need to produce stable clones with a knocked out MMR gene or a clone 
expressing a dominant negative MMR gene allele. Another advantage is that host 
organisms need not be screened for integrated knock out targeting vectors or stable 
5 expression of a dominant negative MMR gene allele. Finally, once a cell, plant or animal 
has been exposed to the MMR-blocking compound and a new output trait is generated, the 
MMR process can be restored by removal of compound. Mutations can be detected by 
analyzing the genotype of the cell, or whole organism, for example, by examining the 
sequence of genomic DNA, cDNA, messenger RNA, or amino acids associated with the 
10 gene of interest. Mutations can also be detected by screening for new output traits such as 
hvpoxmthine-guanine phosphoribosyltransferase (HPRT) revertants. A mutant 
polypeptide can be detected by identifying alterations in electrophoretic mobility, 
spectroscopic properties, or other physical or structural characteristics of a protein encoded 
by a mutant gene. One can also screen for altered function of the protein in situ, in isolated 
form, or in model systems. One can screen for alteration of any property of the cell, plant 
or animal associated with the function of the gene of interest. 

Several advantages exist in generating genetic mutations by blocking MMR in vivo 
in contrast to general DNA damaging agents such as MNNG, MNU and EMS. Cells with 
MMR deficiency have a wide range of mutations dispersed throughout their entire genome 
in contrast to DNA damaging agents such as MNNG, MNU, EMS and ionizing radiation. 
Another advantage is that mutant cells that arise from MMR deficiency are diploid in 
nature and do not lose large segments of chromosomes as is the case of DNA damaging 
agents such as EMS, MNU, and ionizing radiation (Honma, M. et al. (1997) Mutat. Res. 
374:89-98). This unique feature allows for subtle changes throughout a host's genome that 
leads to subtle genetic changes yielding genetically stable hosts with commercially 
important output traits. 

The invention also encompasses blocking MMR in vivo and in vitro and further 
exposing the cells or organisms to a chemical mutagen in order to increase the incidence of 
genetic mutation. 

The invention also encompasses withdrawing exposure to inhibitors of mismatch 
repair once a desired mutant genotype or phenotype is generated such that the mutations 
are thereafter maintained in a stable genome. 
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The above disclosure generally describes the present invention. A more complete 
understanding can be obtained by reference to the following specific examples, which are 
provided herein for purposes of illustration only, and are not intended to limit the scope of 
the invention. 

5 EXAMPLES 

EXAMPLE 1: Generation of a cell-based screening assay to identify chemicals 
capable of inactivating mismatch repair in vivo. 

A hallmark of MMR deficiency is the generation of unstable microsatellite repeats 
in the genome of host cells (Peinado, M.A. et al (1992) Proc. Natl Acad. ScL USA 

10 89:10065-10069; Strand, M. et al (1993) Nature 365:274-276; Parsons, R. et al (1993) 

Cell 75:1227-1236). This phenotype is referred to as microsatellite instability (MI) (Harfe, 
B.D. and S. Jinks-Robertson (2000) Ann, Rev. Genet. 34:359-399; Modrich, P. (1994) 
Science 266:1959-1960; Peinado, M.A. et al (1992) Proc. Natl Acad. ScL USA 89:10065- 
10069; Perucho, M. (1996) Biol Chem. 377:675-684; Hoang, J.M. et al (1997) Cancer 

15 Res. 57:300-303; Strand, M. et a/.(1993) Nature 365:274-276). MI consists of deletions 
and/or insertions within repetitive mono-, di- and/or tri nucleotide repetitive sequences 
throughout the entire genome of a host cell. Extensive genetic analysis of eukaryotic cells 
have found that the only biochemical defect that is capable of producing MI is defective 
MMR (Harfe, B.D. and S. Jinks-Robertson (2000) Ann. Rev. Genet. 34:359-399; Modrich, 

20 P. (1994) Science 266:1959-1960; Peinado, M.A. et al (1992) Proc. Natl Acad. ScL USA 
89:10065-10069; Perucho, M. (1996) Biol Chem. 377:675-684; Hoang, J.M. et al (1997) 
Cancer Res. 57:300-303; Strand, M. et a/.(1993) Nature 365:274-276). In light of this 
unique feature that defective MMR has on promoting microsatellite instability, 
endogenous MI is now used as a biochemical marker to survey for lack of MMR activity 

25 within host cells (Hoang, J.M. et al (1997) Cancer Res. 57:300-303). 

A method used to detect MMR deficiency in eukaryotic cells is to employ a 
reporter gene that has a polynucleotide repeat inserted within the coding region that 
disrupts its reading frame due to a frame shift. In the case where MMR is defective, the 
reporter gene will acquire random mutations (i.e., insertions and/or deletions) within the 

30 polynucelotide repeat yielding clones that contain a reporter with an open reading frame. 
This reporter gene can be of any biochemical pathway such as but not limited to p- 
glucoronidase, p-galactosidase, neomycin resistant gene, hygromycin resistance gene, 
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green fluorescent protein, and the like. A schematic diagram of MMR-sensitive reporters 
are shown in Fig. 1, where the polynucleotide repeat can consist of mono-, di-, tri- or tetra- 
nucleotides. We have employed the use of a p-galactosidase MMR-sensitive reporter gene 
to measure for MMR activity in H36 cells, which ^ a murine hyhridoma cell line. The 
reporter construct used is called pCAR-OF, which contains a hygromycin resistance 
(HYG) gene plus a p-galactosidase gene with a 29 bp out-of-frame poly-CA tract inserted 
at the 5' end of its coding region. The pCAR-OF reporter cannot generate p-galactosidase 
activity unless a frame-restoring mutation (i.e., insertion or deletion) arises following 
transfection. This line has been shown to be sensitive to inactivated MMR where using a 
dominant negative MMR gene allele has found this condition to result in the production of 
P-galactosidase (unpublished data). An example of these data using the dominant 
negative PMS134 allele is shown in Table 1. Briefly, H36 cells were each transfected with 
an expression vector containing the PMS134 allele (referred to as HB134) or empty vector 
and the pCAR-OF vector in duplicate reactions using the protocol below. The PMS134 
15 gene is cloned into the pEF expression vector, which contains the elongation factor 

promoter upstream of the cloning site followed by a mammalian polyadenylation signal. 
This vector also contains the NEOr gene that allows for selection of cells in G418 to 
identify those retaining this plasmid. Briefly, cells were transfected with 1 ug of the 
PMS134 or empty vector using polyliposomes following the manufacturer's protocol (Life 
20 Technologies). Cells were then selected in 0.5 mg/ml of G418 for 10 days and G418 

resistant cells were pooled together to analyze for gene expression. PMS134 positive cells, 
which were determined by RT-PCR and western blot (not shown) were expanded and 
transfected with the pCAR-OF reporter gene that contains a hygromycin (HYG) resistance 
gene as reporter using the protocol described above. Cells were selected in 0.5 mg/ml 
25 G41 8 and 0.5mg/ml HYG to select for cells retaining both the MMR effector and the 

pCAR-OF reporter plasmids. All cultures transfected with the pCAR vector resulted in a 
similar number of HYG/G418 resistant cells. Cultures were then expanded and tested for 
P-galactosidase activity in situ as well as by biochemical analysis of cell extracts. For in 
situ analysis, 100,000 cells were harvested and fixed in 1% gluteraldehyde, washed in 
phosphate buffered saline solution and incubated in 1 ml of X-gal substrate solution [0.15 
M NaCl, 1 mM MgCl 2 , 3.3 mM K 4 Fe(CN) 6 , 3.3 mM K 3 Fe(CN) 6 , 0.2% X-Gal ] in 24 well 
plates for 2 hours at 37°C. Reactions were stopped in 500 mM sodium bicarbonate 
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solution and transferred to microscope slides for analysis. Three fields of 200 cells each 
were counted for blue (p-galactosidase positive cells) or white (P-galactosidase negative 
cells) to assess for MMR inactivation. Table 1 shows the results from these studies. 
While no p-galactosidase positive cells were observed in H36 empty vector cells and 10% 
5 of the cells per field were p-galactosidase positive in HB 134 cultures. 

Table 1. p-galactosidase expression of H36 empty vector and HB134 cells transfected 
with pCAR-OF reporter vectors. Cells were transfected with the pCAR-OF reporter 
plasmid. Transfected cells were selected in HYG and G418, expanded and stained with 
10 X-gal solution to measure for p-galactosidase activity (blue colored cells). 3 fields of 200 
cells each were analyzed by microscopy. The results below represent the mean +/- 
standard deviation of these experiments. 



Table 1. 



CELL LINE 


# BLUE CELLS 


H36 empty vector 


0+/-0 


HB134 


20 +/- 3 



15 

Cultures can be further analyzed by biochemical assays using cell extracts to measure p- 
galactosidase activity as previously described (Nicolaides, N.C. et al (1998) Mol. Cell 
Biol 18:1635-1641). 

The data described in Table 1 show that by inhibiting the MMR activity of an 
20 MMR proficient cell host can result in MI and the altering of microsatellites in the pCAR- 
OF vector results in cells that produce functional P-galactosidase enzyme. The use of the 
H36pCAR-OF cell line can now be used to screen for chemicals that are able to block 
MMR of the H36 cell line. 



25 EXAMPLE 2: Screening assays for identifying chemical blockers of MMR. 

A method for screening chemical libraries is provided in this example using the 
H36pCAR-OF cell line described in Example 1. This cell line is a hardy, stable line that 
can be formatted into 96-well microliter plates for automated screening for chemicals that 
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specifically block MMR. An overview of the screening process is given in Figure 2, 
however, the process is not limited to the specifications within this example. Briefly, 
10,000 cells in a total volume of 0.1ml of growth medium (RPMI1640 plus 10% fetal 
bovine serum) are added to 96-well microtiter plates cor, aining any variety of chemical 
5 compounds. Cells are grown for 14-17 days at 37°C in 5%C0 2 . Cells are then lysed in the 
growth medium with 50uls of lysis buffer containing 0.1 MTris buffer (pH S.0), 0.1% 
Triton X-100, 45 mM 2-mercaptoethanol, ImM MgCl 2 , 0.1 M NaP0 4 and 0.6 mg/ml 
Chlorophenol-red- p-D-galactopyranoside (CPRG, Roche). Reactions are incubated for 1 
hour, terminated by the addition of 50 uls of 0.5 M Na^, and analyzed by 

10 spectrophotometry at 576 nm. 

Experimental wells are compared to untreated or vehicle treated wells to identify 
those with increased P-galactosidase activity. Compounds producing MMR blocking 
activity are then further analyzed using different cell lines containing the pCAR-OF 
plasmid to measure the ability to block MMR as determined by MI in MMR proficient 

15 hosts by analyzing endogenous microsatellites for instability using assays described below. 

EXAMPLE 3: Defining MMR blocking chemicals. 

The identification of chemical inhibitors of MMR can be difficult in deternuning 
those that are standard mutagens from those that induce genomic instability via the 

20 blockade of MMR. This Example teaches of a method for determining blockers of MMR 
from more general mutagens. Once a compound has been identified in the assay 
described above, one can determine if the compound is a general mutagen or a speific 
MMR blocker by monitoring mutation rates in MMR proficient cells and a controlled 
subclone that is MMR defective. One feature of MMR deficiency is the increased 

25 resistance to toxicity of DNA alkylating agents that allows for enhanced rates of mutations 
upon mutagen exposure (Liu, L., et.al. Cancel- Res (1996) 56:5375-5379). This unique 
feature allows for the use of a MMR proficient cell and a controlled line to measure for 
enhanced activity of a chemical compound to induce mutations in MMR proficient vs 
MMR deficient lines. If the compound is a true inhibitor of MMR then genetic mutations 

30 should occur in MMR proficient cells while no "enhanced " mutation rate will be found in 
already MMR defective cells. Using these criteria chemicals such as ICR191, which 
induces frameshift mutations in mammalian cells would not be considered a MMR 
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reporter construct used is called pCAR-OF, which contains a hygromycin resistance 
(HYG) gene plus a p-galactosidase gene with a 29 bp out-of-frame poly-CA tract inserted 
at the y end of its coding region. The pCAR-OF reporter cannot generate p-galactosidase 
activity unless a frame-restoring mutation (i.e., insertion or deletion) arises following 

5 transfection. This line has been shown to be sensitive to inactivated MMR where using a 
dominant negative MMR gene allele has found this condition to result in the production of 
P-galactosidase (unpublished data). An example of these data using the dominant 
negative PMS134 allele is shown in Table 1. Briefly, H36 cells were each transfected with 
an expression vector containing the PMS134 allele (referred to as HB134) or empty vector 

10 and the pCAR-OF vector in duplicate reactions using the protocol below. The PMS134 
gene is cloned into the pEF expression vector, which contains the elongation factor 
promoter upstream of the cloning site followed by a mammalian polyadenylation signal. 
This vector also contains the NEOr gene that allows for selection of cells in G418 to 
identify those retaining this plasmid. Briefly, cells were transfected with 1 jxg of the 

15 PMS134 or empty vector using polyliposomes following the manufacturer's protocol (Life 
Technologies). Cells were then selected in 0.5 mg/ml of G418 for 10 days and G418 
resistant cells were pooled together to analyze for gene expression. PMS134 positive cells, 
which were determined by RT-PCR and western blot (not shown) were expanded and 
transfected with the pCAR-OF reporter gene that contains a hygromycin (HYG) resistance 

20 gene as reporter using the protocol described above. Cells were selected in 0.5 mg/ml 
G418 and 0.5mg/ml HYG to select for cells retaining both the MMR effector and the 
pCAR-OF reporter plasmids. All cultures transfected with the pCAR vector resulted in a 
similar number of HYG/G418 resistant cells. Cultures were then expanded and tested for 
P-galactosidase activity in situ as well as by biochemical analysis of cell extracts. For in 

25 situ analysis, 100,000 cells were harvested and fixed in 1% gluteraldehyde, washed in 

phosphate buffered saline solution and incubated in 1 ml of X-gal substrate solution [0.15 
M NaCl, 1 mM MgCl 2 , 3.3 mM K 4 Fe(CN) 6 , 3.3 mM K 3 Fe(CN) 6 , 0.2% X-Gal ] in 24 well 
plates for 2 hours at 37°C. Reactions were stopped in 500 mM sodium bicarbonate 
solution and transferred to microscope slides for analysis. Three fields of 200 cells each 

30 were counted for blue (p-galactosidase positive cells) or white (P-galactosidase negative 
cells) to assess for MMR inactivation. Table 1 shows the results from these studies. 
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blocking compound because of its ability to produce enhanced mutation rates in already 
MMR defective cell lines (Chen, W.D., et.al. J Natl Cancer Inst. (2000) 92:480-485). 
These screening lines include the but are not limited those in which a dominant negative 
MMR gene has been introduced such as that described in EX/ MPLE 1 or those in which 
5 naturally MMR deficient cells such as HCT1 16 has been cured by introduction of a 
complementing MMR gene as described (Chen, W.D., et.al. J Natl Cancer Inst. (2000) 
92:480-485). 

EXAMPLE 4: Identification of chemical inhibitors of MMR in vivo. 

10 MMR is a conserved post replicative DNA repair mechanism that repairs point 

mutations and insertion/deletions in repetitive sequences after cell division. The MMR 
requires an ATPase activity for initiation complex recognition and DNA translocation. In 
vitro assays have shown that the use of nonhydrolyzable forms of ATP such as AMP-PNP 
and ATP[gamma]S block the MMR activity (Galio, L. et al. (1999) Nucl. Acids Res. 

15 27:2325-2331; Allen, D.J. et al. (1997) EMBOJ. 16:4467-4476; BjomsonK.P. et al 
(2000) Biochem. 39:3176-3183). 

The use of chemicals to inhibit endogenous MMR in vivo has not been 
distinguished in the public domain. In an attempt to identify chemicals that can inhibit 
MMR in vivo, we used our H36pCAR-OF screening assay to screen for chemicals that are 

20 able to cause microsatellite instability and restoration of p-galactosidase activity from the 
pCAR-OF vector, an effect that can only be caused due to MMR deficiency. In our 
screening assays we used a variety of classes of compounds ranging from steroids such as 
pontasterone to potent alkylating agents such as EMS, to kinase and other enzyme 
inhibitors. Screens identified one class of chemicals that were capable of generating P- 

25 galactosidase positive cells. These molecules were derived from the anthracene class. An 
example of one such anthracene derivative for the purposes of this application is a 
molecule called 9,10-dimethylanthracene, referred to from here on as DMA. Fig. 3 shows 
the effect of DMA in shifting the pCAR-OF reporter plasmid. In contrast, general DNA 
alkylating agents such as EMS or MNNG did not result in MI and/or the shifting of the 

30 polynulceotide tract in the pCAR-OF reporter. 

The most likely explanation for the differences in p-galactosidase activity was that 
the DMA compound disturbed MMR activity, resulting in a higher frequency of mutation 
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within the pCAR-OF reporter and re-establishing the ORF. To directly test the hypothesis 
that MMR was altered, we employ a biochemical assay for MMR with the individual 
clones as described by Nicolaides et at., 1997 (Nicolaides, N.C. et al (1998) Mol Cell 
Biol 18:1635-1641). Nuclear extracts are prepared from the clones and incubated with 
5 heteroduplex substrates containing either a /CA\ insertion-deletion or a G/T mismatch 
under conditions described previously. The /CA\ and G/T heteroduplexes are used to test 
repair from the 3* and 5' directions, respectively as described (Nicolaides, N.C. et al 
(1998) Mol Cell Biol 18:1635-1641). 

10 Biochemical assays for mismatch repair. 

Enzymatic Repair Assays: 

MMR activity in nuclear extracts is performed as described, using 24 finol of 

substrate (Nicolaides, N.C. et al (1998) Mol Cell Biol 18:1635-1641). 

Complementation assays are done by adding - 100 ng of purified MutLa or MutSa 
15 components to 100 jig of nuclear extract, adjusting the final KC1 concentration to 100 mM 

(Nicolaides, N.C. etal (1998) Mol Cell Biol 18:1635-1641). The substrates used in 

these experiments contain a strand break 181 nucleotides 5' or 125 nucleotides 3' to the 

mismatch. 

20 Biochemical Activity Assays : 

To demonstrate the direct effect to small molecules on MMR proteins, molecular 
assays such as mismatch binding and MMR complex formation are performed in the 
presence or absence of drug. Briefly, MMR gene cDNAs are PCR amplified using primers 
encompassing the entire coding regions of the known MMR proteins MSH2 (SEQ ID 

25 NO:20), GTBP (SEQ ID NO:26) ? MLH1 (SEQ ID NO:22), human PMS2 (SEQ ID 

NO:16), mouse PMS2 (SEQ ID NO:14), PMS1 (SEQ ID NO:18), and MHS3 (SEQ ID 
NO:28) from any species with a sense primer containing a T7 promoter and a Kozak 
translation signal as previously described (Nicolaides, N.C. et al (1998) Mol Cell Biol 
18:1635-1641). The coding regions of known MMR proteins include the sequences shown 

30 in Table 3 for mouse PMS2 (SEQ ID NO:15), human PMS2 (SEQ ID NO:17), human 

PMS1 (SEQ ID NO:19), human MSH2 (SEQ ID NO:21), human MLH1 (SEQ ID NO:23), 
and human MSH3 (SEQ ID NO:29). Products are transcribed and translated using the 
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TNT system (Promega). An example of PCR primers and in vitro transcription- 
translation reactions are listed below. 

In vitro transcription-translation: 

Linear DNA fragments containing hPMS2 (SEQ ID NO:17) and hMLHl (SEQ ID 
NO:23) cDNA sequences were prepared by PCR, incorporating sequences for in vitro 
transcription and translation in the sense primer. A full-length liMLHl fragment was 
prepared using the sense primer 

S'-ggatcctaatacgactcactatagggagaccaccatgtcgttcgtggcaggg-S' (SEQ ID NO:l)(codons 1-6) 
and the antisense primer S'-taagtcttaagtgctaccaac-S' (SEQ ID NO:2)(located in the 3' 
untranslated region, nt 241 1-2433), using a wild-type hMLHl cDNA clone as template. A 
full-length hPMS2 fragment was prepared with the sense primer 

S'-ggatcctaatacgactcactatagggagaccaccatggaacaattgcctgcgg-S' (SEQ ID NO:3)(codons 1-6) 
and the antisense primer S'-aggttagtgaagactctgtc-S' (SEQ ID NO:4)(located in 3' 
untranslated region, nt 2670-2690) using a cloned hPMS2 cDNA as template. These 
fragments were used to produce proteins via the coupled transcription-translation system 
(Promega). The reactions were supplemented with 35 S-labelled methionine or unlabelled 
methionine. Lower molecular weight bands are presumed to be degradation products 
and/or polypeptides translated from alternative internal methionines. 

To study the effects of MMR inhibitors, assays are used to measure the formation 
of MLH1 and PMS2 with or without compound using polypeptides produced in the TNT 
System (Promega) followed by immunoprecipitation (IP). To facilitate the IP, tags may be 
placed at the C-tenninus of the PMS2 protein to use for antibody binding or antibodies 
directed to the MMR protein itself can be used for IP. 
Immunoprecipitations: 

Immunoprecipitations are performed on in vitro translated proteins by mixing the 
translation reactions with 1 ug of the MLH1 specific monoclonal antibody (mAB) MLH14 
(Oncogene Science, Inc.), a polyclonal antibody generated to codons 2-20 of hPMS2 
described above, or a polyclonal antibody generated to codons 843-862 of hPMS2 (Santa 
Cruz Biotechnology, Inc.) in 400 ul of EBC buffer (50 mM Tris, pH 7.5, 0. 1 M NaCl, 
0.5% NP40). After incubation for 1 hr at 4°C, protein A sepharose (Sigma) is added to a 
final concentration of 10% and reactions are incubated at 4°C for 1 hour. Proteins bound 
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to protein A are washed five times in EBC and separated by electrophoresis on 4-20% 
Tris-glycine gels, which are then dried and autoradiographed. 

Compounds that block heterodimerization of mutS or mutL proteins can now be 
identified using this assay. 

5 

EXAMPLE 5: Use of chemical MMR inhibitors yields microsatellite instability in 
human cells 

In order to demonstrate the global ability of a chemical inhibitor of MMR in host 
cells and organisms, we treated human HEK293 cells (referred to as 293 cells) with DMA 

10 and measured for microsatellite instability of endogenous loci using the BAT26 diagnostic 
marker (Hoang J.M. et al (1997) Cancer Res. 57:300-303). Briefly, 10 s cells were grown 
in control medium or 250 \xM DMA, a concentration that is found to be non-toxic, for 14 
to 17 days. Cells are then harvested and genomic DNA isolated using the salting out 
method (Nicolaides, N.C. et al (1991) Mol Cell Biol 11:6166-6176.). 

15 Various amounts of test DNAs from HCT1 16 (a human colon epithelial cell line) 

and 293 were first used to determine the sensitivity of our microsatellite test. The B AT26 
alleles are known to be heterogeneous between .these two cell lines and the products 
migrate at different molecular weights (Nicolaides personal observation). DNAs were 
diluted by limiting dilution to determine the level of sensitivity of the assay. DNAs were 

20 PCR amplified using the BAT26F: 5'-tgactacttttgacttcagcc-3' (SEQ ID NO:43) and the 
BAT26R: 5'-aaccattcaacatttttaaccc-3' (SEQ ID NO:44) primers in buffers as described 
(Nicolaides, N.C. et al (1995) Genomics 30:195-206). Briefly 1 pg to 100 ngs of DNA 
were amplified using the following conditions: 94°C for 30 sec, 58°C for 30 sec, 72°C for 
30 sec for 30 cycles. PCR reactions were electrophoresed on 12% polyacrylamide TBE 

25 gels (Novex) or 4% agarose gels and stained with ethidium bromide. These studies found 
that 0.1 ng of genomic DNA was the limit of detection using our conditions. 

To measure for microsatellite stability in 293 cells grown with or without DMA, 
0. 1 ngs of DNA from DMA-treated or control 293 cells were amplified using the reaction 
conditions above. Forty individual reactions were carried out for each sample to measure 

30 for minor alleles. Fig. 4A shows a typical result from these studies whereby BAT26 

alleles were amplified from DMA-treated and untreated cells and analyzed on 12% PAGE 
gels (Novex). Alleles from DMA-treated cells showed the presence of an altered allele 
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(asterisk) that migrated differently from the wild type allele. No altered alleles were found 
in the MMR-proficient control cells as expected since MI only occurs in MMR defective 
cell hosts. To confirm these data, PCRs were repeated using isolated BAT26 products. 
Primers and conditions were the same as described above except that reactions were 
5 amplified for 20 cycles. PCR products were gel-purified and cloned into T-tailed vectors 
(InVitrogen) as suggested by the manufacturer. Recombinant clones from DMA-treated 
and control cells were screened by PCR again using the BAT26 primers. Fifty bacterial 
colonies were analyzed for BAT26 structure by directly adding an aliquot of live bacteria 
to the PCR mix. PCR reactions were carried out as described above, and products were 
10 electrophoresed on 4% agarose gels and stained with ethidium bromide. As shown in 

Figure 4B, microsateUites from DMA-treated cells had alterations (asterisks) that made the 
marker length larger or smaller than the wild type allele found in control cells. 

To confirm that these differences in molecular weight were due to shifts within the 
polynucleotide repeat, a hallmark of defective MMR, five clones from each sample were 
15 sequenced using an ABI automated sequencer with an M13-R primer located in the T-tail 
vector backbone. Sequence analysis revealed that tihe control cell clone used in our studies 
was homozygous for the BAT26 allele with a 26nt polyA repeat. Cells treated with DMA 
found multiple alleles ranging from the wild-type with 26 polyA repeat to shorter alleles 
(24 polyA repeat) and larger alleles (28 polyA repeat) (Fig. 5). 
20 These data corroborate the H36pCAR data in Example 1 and Fig. 3 and 

demonstrates the ability to block MMR with a chemical in a range of hosts. 

Example 6: Chemical inhibitors of MMR generate DNA hypermutability in Plants 
and new phenotypes. 

25 To determine if chemical inhibitors of MMR work across a diverse array of 

organisms, we explored the activity of DMA on Arabidopsis thaliana (AT), a member of 
the mustard plant family, as a plant model system to study the effects of DMA on 
generating MMR deficiency, genome alterations, and new output traits. 

Briefly, AT seeds were sterilized with straight commercial bleach and 100 seeds 

30 were plated in 100mm Murashige and Skoog (MS) phytagar (Life Technology) plates with 
increasing amounts of DMA (ranging from lOOum to 50mM). A similar amount of seeds 
were plated on MS phytagar only or in MS phytagar with increasing amounts of EMS 
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(lOOfiM to 50mM) ? a mutagen commonly used to mutate AT seeds (McCallum, C.M.et al. 
(2000) Nat BiotechnoLl8:455-457). Plates were grown in a temperature-controlled, 
fluorescent-lighted humidifier (Percival Growth Chamber) for 10 days. After 10 days, 
seeds were counted to determine toxicity levels for each compound. Table 2 shows the 
number of healthy cells/treatment as determined by root formation and shoot formation. 
Plantlets that were identical to untreated seeds were scored healthy. Seeds with stunted 
root or shoot formation were scored intermediate (inter). Non-germinated seeds were 
scored dead. 

Table 2: Toxicity curve of DMA and EMS on Arabidopsis (per 100 cells) 




The data in Table 2 show that DMA toxicity occurs at lOmM of continuous 
culture, while toxicity occurs at 250 joM for EMS. Next, 50 seeds were plated in two 
150mm dishes containing 2mM DMA, 250 jiM EMS or no drug. Seeds were grown for 10 
days and then 10 plants from each plate were transferred to soil. All plants appeared to be 
similar in color and height. Plants were grown at room temperature with daily cycles of 1 S 
hr light and 6 hr dark. After 45 days seeds are harvested from siliques and stored in a 
desiccator at 4°C for 72 hours. Seeds are then sterilized and 100 seeds from each plant is 
sown directly into water-saturated soil and grown at room temperature with daily cycles of 
18 hr light and 6 hr dark. At day 10 phenotypically distinct plants were found in 7 out of 
118 DMA treated while no phenotypic difference was observed in 150 EMS-treated or 150 
control plants. These 7 altered plants were light green in color and appeared to grow 
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slower. Fig. 6 shows a typical difference between the DMA altered plant and controls. 
DMA-exposed plants produced offspring that were yellow in appearance in contrast to 
dark green, which is always found in wild-type plants. In addition, the yellow plants were 
also shorter. After 30 days, most wild-type plants produced flowers and siliques, while the 
7 mutants just began flowering. After 45 days, control plants were harvested while mutant 
plants were harvested 10 to 15 days later. No such effects were observed in 150 plantlets 
from EMS treated plants. 

The effect of DMA on MMR was confirmed by monitoring the structure of 
endogenous polynucleotide repeat markers within the plant genome. DNA was extracted 
using the DNAzol method following the manufacturer's protocol (Life Technology). 
Briefly, two leaves were harvested from DMA, EMS or untreated plants and DNA was 
extracted. DNAs were quantified by optical density using a BioRad Spectrophotometer. 
InArabidopsis, a series of poly-A (A) n , (CA)„ and (GA) n markers were found as a result of 
EMBL and GenBank database searches of DNA sequence data generated as a result of the 
Arabidopsis genome-sequencing project. Two markers that are naturally occurring, 
ATHACS and Ngal28 are used to monitor microsatellite stability using primers described 
(Bell, C.J. and J.R. Ecker (1994) Genomics 19:137-144). ATHACS has a stretch of thirty- 
six adenine repeats (A) 36 whereas Ngal28 is characterized by a di-nucleotide AG repeat 
that is repeated nineteen times (AG) 19 while the Nga280 marker contains a polyAG repeat 
20 marker with 15 dinucleotides. DMA-mediated alterations of these markers are measured 
by a PCR assay. Briefly, the genomic DNA is amplified with specific primers in PCR 
reaction buffers described above using l-10ng plant genomic DNA. Primers for each 
marker are listed below: 
nga280: 

25 nga280-F: 5 '-CTGATCTC ACGGAC AATAGTGC-3 ' (SEQ ID NO S) 

nga280-R: 5 '-GGCTCCATAAAAAGTGCACC-3 ' (SEQ ID NO:6) 

ngal28: 

ngal28-F: 5 '-GGTCTGTTGATGTCGTAAGTCG-3 ' (SEQ ED NC-7) 
30 ngal28-R: 5 '-ATCTTGAAACCTTTAGGGAGGG-3 ' (SEQ ID NO:8) 

ATHACS: 

ATHACS-F: 5 '-AGAAGTTTAGAC AGGT AC-3 ' (SEQ ID NC-9) 
ATHACS-R: 5 '-AAATGTGCAATTGCCTTC-3 ' (SEQ ED NO:10) 
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Cycling conditions are 94°C for 15 seconds, 55°C for 15 seconds and 72°C for 30 
seconds, conditions that have been demonstrated to efficiently amplify these two markers 
(personal observation, Morphotek)* PCR products are analyzed on 3.5% metaphor agarose 
gel in Tris-AcetatC-EDTA buffer following staining with ethidium bromide. 
5 Another method used to demonstrate that biochemical activity of a plant host's 

MMR is through the use of reporter genes disrupted by a polynucleotide repeat, similar to 
that described in Example 1 and Fig. 1 . Due to the high endogenous p-galactosidase 
background, we engineered a plant compatible MMR-sensitive reporter gene consisting of 
the (3-glucoronidase (GUS) gene with a mononucleotide repeat that was inserted just 

10 downstream of the initiation codon. Two reporter constructs were generated. pGUS-OF, 
contained a 20 base adenine repeat inserted just downstream of the initiating methionine 
that resulted in a frameshift, therefore producing a nonfunctional enzyme. The second, 
pGUS-IF, contained a 19 base adenine repeat that retained an open reading frame and 
served as a control for p-glucoronidase activity. Both constructs were generated by PCR 

15 using the pBI-121 vector (Life Technologies) as template. The antisense primer was 

directed to the 3' end of the Nopaline Synthase (NOS) polytennination sequence contained 
within the pBI-121 plasmid and contained a unique EcoRI restriction site to facilitate 
cloning of the vector into the pBI-121 binary vector backbone. The sense primers 
contained a unique BamHI restriction site to facilitate cloning of the chimeric GUS 

20 reporter gene into the pBI-121 binary vector backbone. The primers used to generate each 
reporter are: 

1. sense primer for pGUS-IF (uidA-ATG-polyA-IF) : 

5'- CCC GGA TCC ATG TTA AAA AAA AAA AAA AAA AAA CGT CCT GTA GAA ACC-3' (SEQ 
25 ID NO:ll) 

2. sense primer for pGUS-OF (uidA-ATG-polyA-OF) : 

5'- CCC GGA TCC ATG TTA AAA AAA AAA AAA AAA AAA ACG TCC TGT AGA AAC C-3' 



30 



(SEQ ID NO:12) 

3. antisense primer (Nos-term) : 

5'- CCC GAA TTC CCC GAT CTA GTA ACA TAG ATG- 3' (SEQ ID NO: 13) 



PCR amplifications were carried out using reaction buffers described above. 
35 Reactions were performed using 1 ng of pBI-121 vector as template (Life Technologies) 
and the appropriate corresponding primers. Amplifications were carried at 94°C for 30 
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seconds, 54°C for 60 seconds and 72°C for 60 seconds for 25 cycles. PCR products of the 
expected molecular weight was gel purified, cloned into T-tailed vectors (InVitrogen), and 
sequenced to ensure authentic sequence using the following primers: CaMV-FORW. [= 5'- 
gat ate tec act gac gta ag-3'] (SEQ ID NO:30) for sequencing from the CaMV promoter 
5 into the 5' end of GUS cDNAs; NOSpA-42F [= 5'-tgt tgc egg tot tgc gat g-3'] (SEQ ID 
NO:31) for sequencing of the NOS terminator; NOSpA-Cend-R [= 5'-ccc gat eta gta aca 
tag atg-3'] (SEQ ID NO:32) for sequencing from the NOS terminator into the 3' end of the 
GUS cDNAs; GUS-63F [= 5'-cag tct gga teg cga aaa ctg-3'] (SEQ ID NO:33), GUS-441F 
[= 5'-ggt gat tac cga cga aaa cg-3'] (SEQ ID NO:34), GUS-825F [= 5'-agt gaa ggg cga aca 
10 gtt cc-3'] (SEQ ID NO:35), GUS-1224F [= 5'-gag tat tgc caa cga acc-3'] (SEQ ID NO:36), 
GUS-1596F [= 5'-gta tea ccg cgt ctt tga tc-3'] (SEQ ID NO:37), GUS-265R [= 5'-cga aac 
gca gca cga tac g-3'] (SEQ ID NO:38), GUS-646R [= 5'-gtt caa cgc tga cat cac c-3'] (SEQ 
ID NO:39), GUS-1033R [= 5'-cat gtt cat ctg ccc agt cg-3'] (SEQ ID NO:40), GUS-1425R 
[= 5'-gct ttg gac ata ccatcc-3'] (SEQ ID NO:41), and GUS-1783R [= S'-cac cga agt tea tgc 
15 cag-3 '] (SEQ ID NO:42) for the sequence of the full length GUS cDNAs. No mutation 
were found in either the OF or IF version of the GUS cDNA, and the expected frames for 
both cDNAs were also confirmed. pCR-IF-GUS and pCR-OF-GUS plasmids were 
subsequently digested with the BamH I and EcoR I restriction endonucleases, to generate 
DNA fragments containing the GUS cDNA along with the NOS terminator. These 
20 fragments were ligated into the BamH I and the EcoR I sites of the pBI- 1 2 1 plasmid, 
which was prepared for cloning by cutting it with the same enzymes to release the wild 
type GUS cDNA. The resulting constructs (pBI-IF-GUS and pBI-OF-GUS) were 
subsequently digested with Hind m and EcoR I to release the DNA fragments 
encompassing the CaMV promoter, the IF or OF GUS cDNA, and the NOS terminator. 
25 Finally, these fragments were ligated into the correspondent restriction sites present in the 
pGPTV-HPT binary vector (ATCC) to obtain the pCMV-IF-GUS-HPT and pCMV-OF- 
GUS-HPT binary vectors. 

The resulting vectors, CMV-OF-GUS-HPT and CMV-IF-GUS-HPT now contain 
the CaMV35S promoter from the Cauliflower Mosaic 35 S Virus driving the GUS gene 
30 followed by a NOS terminator and polyadenylation signal (Fig. 7). In addition, this vector 
also contains a hygromycin resistance gene as a selectable marker that is used to select for 
plants containing this reporter. 
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Generation of GUS reporter-expressing Arabidopsis thaliana transgenic plants. 

Agrobacterium tumefaciens bacteria are used to shuttle binary expression vectors 
into plants. To generate f3-glucoronidase-expressing Arabidopsis thaliana (A. thaliana) 
plants, Agrobacteriui r. tumefaciens cells (strain GV3101) were electroporated with the 
5 CMV-OF-GUS-HPT or the CMV-IF-GUS-HPT binary vector using methods known by 
those skilled in the art. Briefly, one-month old A. thaliana (ecotype Columbia) plants 
were infected by immersion in a solution containing 5% sucrose, 0.05% silwet and binary 
vector-transformed Agrobacteria cells for 10 seconds. These plants were then grown at 
25°C under a 16 hour day and 8 hour dark photoperiod. After 4 weeks, seeds (referred to as 

10 Tl) were harvested and dried for 5 days. Thirty thousands seeds from ten CMV-OF-GUS- 
HPT or CMV-DF-GUS-HPT-transformed plants were sown in solid Murashige and Skoog 
(MS) media plates in the presence of 20 |ng/ml of hygromycin (HYG). Three hundred 
plants were found to be HYG resistant and represented GUS expressing plants. These 
plants along with 300 control plants were grown in MS media for two weeks and then 

15 transferred to soil. Plants were grown for an additional four weeks under standard 
conditions at which time T2 seeds were harvested. 

To confirm the integration and stability of the GUS vector in the plant genome, 
gene segregation and PCR analyses were conducted. Commonly, three out of four Tl 
plants transformed by Agrobacteria technology are expected to carry the vector inserted 

20 within a single locus and are therefore considered heterozygous for the integrated gene. 
Approximately 75% of the seeds (T2) generated from most of the Tl plants were found 
HYG-resistant and this in accordance with the expected 1 :2: 1 ratio of null (no GUS 
containing plants), heterozygous, and homozygous plants, respectively, in self-pollinating 
conditions. To confirm that these plants contained the GUS expression vector, genomic 

25 DNA was isolated from leaves of Tl plants using the DNAzol-mediated technique as 
described above. One ng of genomic DNA was analyzed by polymerase chain reaction 
(PCR) to confirm the presence of the GUS vector. PCR was carried out for 25 cycles with 
the following parameters: 95°C for 30 seconds; 54°C for 1 minute; and 72°C for 2 minutes 
using primers listed above. Positive reactions were observed in DNA from CMV-OF- 

30 GUS-HPT and CMV-IF-GUS-HPT-transformed plants and not from control (uninfected) 
plants. 
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In order to assess the expression of the GUS in Tl plants, leaf tissue was collected 
from Tl plants, homogenized in liquid nitrogen using glass pestles, and suspended in RLT 
lysing buffer (Qiagen, RNeasy plant RNA extraction kit). Five micrograms of total RNA 
was purified according to the manufacturer's suggested protocol and then loaded onto a 
5 1 .2% agarose gel (lx MOPS buffer, 3% formaldehyde), size-fractionated by 

electrophoresis, and transferred onto N-Hybond+ membrane (Amersham). Each membrane 
was incubated at 55°C in 10 ml of hybridization solution (North2South labeling kit. Pierce) 
containing 100 ng of GUS, tubulin, or HYG probes, which were generated by PCR 
amplification, according to the manufacturer's directions. Membranes were washed three 
10 times in 2x SSC, 0.1% SDS at 55°C, and three times in 2x SSC at ambient temperature. 
Detection was carried out using enhanced chemiluminescence (ECL). GUS message was 
detected in three out of ten analyzed transgenic lines, while no signal was found in the 
control plants. Collectively these studies demonstrated the generation of GUS expressing 
transgenic A. thaliana plants. 
15 To determine the status of MMR activity in host plants, one can measure for the 

production of functional P-glucoronidase by staining plant leaves or roots in situ for 0-glu 
activity. Briefly, plant tissue is washed twice with water and fixed in 4 mis of 0.02% 
glutaraldehyde for 15 minutes. Next, tissue is rinsed with water and incubated in X-glu 
solution [0. 1M NaP0 4 , 2.5 mM K 3 Fe(CN) 6 , 2.5mM K 4 Fe(CN) 6 , 1 .5 mM MgCl 2 , and 1 
20 mg/ml X-GLU (5 bromo-4-chloro-3-indoyl- p-D-glucuronide sodium salt) (Gold 

Biotechnology)] for 6 hours at 37°C. Tissues are then washed twice in phosphate buffered 
saline (PBS) solution, once in 70% ethanol and incubated for 4 hours in methanolracetone 
(3:1) for 8 hours to remove chlorophyll. Tissues are then washed twice in PBS and stored 
in PBS with 50% glycerol. Plant tissue with functional GUS activity will stain blue. 
25 The presence of GUS activity in CMV-IF-GUS-HPT plants indicates that the in- 

frame N-terminus insertion of the poly A repeat does not disrupt the GUS protein function. 
The CMV-OF-GUS-HPT plants treated with DMA, EMS or untreated are tested to 
determine if these plants produce GUS activity. The presence of GUS activity in DMA 
treated plants indicates that the polyA repeat was altered, therefore, resulting in a frame- 
restoring mutation. Agents such as EMS, which are known to damage DNA by alkylation 
cannot affect the stability of a polynucleotide repeat. This data indicates that plants are 
defective for MMR, the only process known to be responsible for MI. 
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These data demonstrate the utility and power of using a chemical inhibitor of 
MMR to generate a high degree of genetic alteration that is not capable by means of 
standard DNA damaging drugs. Moreover, this application teaches of the use of reporter 
genes such as GUS-OF in p lants to monitor for the MMR activity of a plant host. 

5 

EXAMPLE 7: Use of chemical MMR inhibitors yields microsatellite instability in 
microbes. 

To demonstrate the ability of chemical inhibitors to block MMR in a wide range of 
hosts, we employed the use of Pichia yeast containing a pGUS-OF reporter system similar 

10 to that described in Example 5. Briefly, the GUS-OF and GUS-IF gene, which contains a 
polyA repeat at the N-terminus of the protein was subcloned from the pCR-IF-GUS and 
pCR-OF-GUS plasmids into the EcoRI site of the pGP vector, which is a consitutively 
expressed yeast vector containing a zeocin resistance gene as selectable marker. pGP- 
GUS-IF and pGP-GUS-OF vectors were electroporated into competent Pichia cells using 

15 standard methods known by those skilled in the art. Cells were plated on YPD agar (1 Og/L 
yeast extract; 20 g/L peptone; 2% glucose; 1.5% bactoagar) plates containing 100 fig/ml 
zeocin. Recombinant yeast are then analyzed for GUS expression/function by replica 
plating on YPD agar plates containing 100 jxg/ml zeocin plus 1 mg/ml X-glu (5-bromo-4- 
chloro-3-indoyl-beta-D-glucuronide sodium salt) and grown at 30°C for 16 hours. On 

20 hundred percent of yeast expressing GUS-IF were found to turn blue in the presence of the 
X-glu substrate while none of the control yeast turned blue. None of the yeast containing 
the GUS-OF turned blue in the presence of the X-glu substrate under normal growth 
conditions. 

To demonstrate the ability of chemicals to block MMR in yeast, GUS-OF and 
25 control cells were incubated with 300 \\M DMA, EMS, or no chemical for 48 hours. After 
incubation, yeast were plated on YPD-ZEO-X-GLU plates and grown at 30°C for 16 hours. 
After incubation, a subset of yeast expressing GUS-OF contain blue subclones, while 
none are seen in EMS or control cells. These data demonstrate the ability of chemicals to 
block MMR of microbes in vivo to produce subclones with new output traits. 

30 
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EXAMPLE 8: Classes of other chemicals capable of blocking MMR in vivo 

The discovery of anthracene compounds presents a new method for blocking MMR 
activity of host organisms in vivoi While 9,10-dimethylanthracene (DMA) was found to 
block MMR in cell hosts, other analogs with a similar chemical composition from this 
5 class are also claimed in this invention. These include anthracene and related analogs such 
as 9,10-diphenylanthracene and 9,10-di-M-tolylanthracene. Myers et al. ((1988) Biochem. 
Biophys. Res. Commun. 151:1441-1445) disclosed that at high concentrations, DMA acts 
as a potent weak mutagen, while metabolized forms of DMA are the "active" ingredients 
in promoting mutation. This finding suggests that metabolites of anthracene-based 
10 compounds may also act as active inhibitors of MMR in vivo. For instance, metabolism of 
anthracene and 9,10-dimethylanthracene by Micrococcus sp., Pseudomonas sp. and 
Bacillus macerans microbes have found a number of anthracene and 9,10- 
dimethylanthracene metabolites are formed. These include anthracene and 9,10- 
dimethylanthracene cis-dihydrodiols, hydroxy-methyl-derivatives and various phenolic 
15 compounds. Bacteria metabolize hydrocarbons using the dioxygenase enzyme system, 
which differs from the mammalian cytochrome P-450 monoxygenase. These findings 
suggest the use of bacteria for bioti^forming anthracene and DMA for additional MMR 
blocking compounds (Traczewska, T.M. etal. (1991) Acta. Microbiol. Pol. 40:235-241). 
Metabolism studies of DMA by rat-liver microsomal preparations has found that this 
molecule is converted to 9-Hydroxymethyl-10-methylanthracene (9-OHMeMA) and 9,10- 
dihydroxymethyl-anthracene (9,10-DiOHMeA) (Lamparczyk, H.S. etal. (1984) 
Carcinogenesis 5:1405-1410). In addition, the trans-l,2-dihydro-l,2-dihydroxy derivative 
of DMA (DMA 1,2-diol) was found to be a major metabolite as determined by 
chromatographic, ultraviolet (UV), nuclear magnetic resonance (NMR), and mass spectral 
25 properties. DMA 1 ,2-diol was also created through the oxidation of DMA in an ascorbic 
acid-ferrous sulfate-EDTA system. Other dihydrodiols that are formed from DMA by 
metabolism are the trans-1,2- and 3,4-dihydrodiols of 9-OHMeMA (9-OHMeMA 1,2-diol 
and 9-OHMeMA 3,4-diol) while the further metabolism of DMA 1,2-diol can yield both of 
these dihydrodiols. Finally, when 9-OHMeMA is further metabolized, two main 
30 metabolites are formed; one was identified as 9,10-DiOHMeA and the other appeared to be 
9-OHMeMA 3,4-diol. 



20 
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The metabolism of 9-methylanthracene (9-MA), 9-hydroxymethylanthracene (9- 
OHMA), and 9,10-dimethylanthracene (9,10-DMA) by fungus also has been reported 
(Cerniglia, C.E. et ah (1990) Appl. Environ, Microbiol. 56:661-668). These compounds 
are also useful for generating DMA derivatives capable of blocking MMR. Compounds 9- 
5 MA and 9,10-DMA are metabolized by two pathways, one involving initial hydroxylation 
of the methyl group(s) and the other involving epoxidation of the 1,2- and 3,4- aromatic 
double bond positions, followed by enzymatic hydration to form hydroxymethyl trans- 
dihydrodiols. For 9-MA metabolism, the major metabolites identified are trans- 1,2- 
dihydro-l,2-dihydroxy and trans-3,4-dihydro-3,4-dihydroxy derivatives of 9-MA and 9- 

10 OHMA, whereby 9-OHMA can be further metabolized to trans- 1,2- and 3,4-dihydrodiol 
derivatives. Circular dichroism spectral analysis revealed that the major enantiomer for 
each dihydrodiol was predominantly in the S,S configuration, in contrast to the 
predominantly R,R configuration of the trans-dihydrodiol formed by mammalian enzyme 
systems. These results indicate that Caenorhabditis elegans metabolizes methylated 

15 anthracenes in a highly stereoselective manner that is different from that reported for rat 
liver microsomes. 

The analogs as listed above provide an example but are not limited to anthracene- 
derived compounds capable of eliciting MMR blockade. Additional analogs that are of 
potential use for blocking MMR are shown in Fig. 8. 

20 

Other classes of small molecular weight compounds that are capable of blocking 
MMR in vivo. 

MMR is a multi-step process that involves the formation of protein complexes that 
detect mismatched bases or altered repetitive sequences and interface these mutations with 

25 enzymes that degrade the mutant base and repair the DNA with correct nucleotides. First, 
mismatched DNA is recognized by the mutS heterodimeric complex consisting of MSH2 
and GTBP proteins. The DNA bound mutS complex is then recognized by the mutL 
heterdimeric complex that consists of PMS2 and MLH1 proteins. The mutL complex is 
thought to interface exonucleases with the mismatched DNA site, thus initiating this 

30 specialized DNA repair process. After the mismatched bases are removed, the DNA is 
repaired with a polymerase. 
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There are several steps in the normal process that can be targeted by small 
molecular weight compounds to block MMR. This application teaches of these steps and 
the types of compounds that may be used to block this process. 

5 ATPase inhibitors: 

The finding that nonhydrolyzable forms of ATP are able to suppress MMR in vitro 
also suggest that the use for this type of compound can lead to blockade of MMR in vivo 
and mutation a host organism's genome (Galio, L. et al. (1999) Nucl. Acids Res. 27:2325- 
2331; Allen, DJ. et al (1997) EMBOJ. 16:4467-4476; Bjornson, K.P. et al (2000) 
10 Biochem. 39:3176-3183). One can use a variety of screening methods described within 
this application to identify ATP analogs that block the ATP-dependent steps of mismatch 
repair in.vivo. 



15 



Nuclease inhibitors: 

The removal of mismatched bases is a required step for effective MMR (Harfe, 
B.D. and S. Jinks-Robertson (2000) Ann. Rev. Genet. 34:359-399). This suggests that 
compounds capable of blocking this step can lead to blockade of MMR in vivo and 
mutation a host organism's genome. One can use a variety of screening methods described 
within this application to identify nuclease inhibitors analogs that block the nuclease steps 
20 of mismatch repair in vivo. An example of the types of nuclease inhibitors are but not 
limited to analogs of N-Ethylmaleimide, an endonuclease inhibitor (Huang, Y.C., etal. 
(1995) Arch. Biochem. Biophys. 316:485), heterodimeric adenine-chain-acridine 
compounds, exonulcease m inhibitors (Belmont P, etal., BioorgMed Chem Lett (2000) 
10:293-295), as well as antibiotic compounds such as Heliquinomycin, which have 
helicase inhibitory activity (Chino, M, et.al. J. Antibiot. (Tokyo) (1998) 51:480-486). 



25 



Polymerase inhibitors : 

Short and long patch repair is a required step for effective MMR (Modrich, P. 
(1994) Science 266:1959-1960). This suggests that compounds capable of blocking 
30 MMR-associated polymerization can lead to blockade of MMR in vivo and mutation a host 
organism's genome. One can use a variety of screening methods described within this 
application to identify polymerase inhibitors analogs that block the polymerization steps of 
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mismatch repair in vivo. An example of DNA polymerase inhibitors that are useful in 
blocking MMR activity include, but are not limited to, analogs of actinomycin D (Martin, 
S.J., etal. (1990) J. Immunol 145:1859), Aphidicolin (Kuwakado, K. et.al. (1993) 
Biochem. Phannacol 46:1909) l-(2 Deoxy-2 ! -fluoro-beta-L-arabinofuraiiosyl)-5- 
5 methyluracil (L-FMAU) (Kukhanova M, et.al., Biochem Pharmacol (199S) 55:11 Si- 
ll 87), and 2 t ,3 t -dideoxyribonucleoside 5 -triphosphates (ddNTPs) (Ono, K., et.al., Biomed 
Pharmacother (1984) 38:382-389). 

Chemical Inhibitors of Mismatch Repair Gene Expression 

10 MMR is a multi-protein process that requires the cooperation of several proteins 

such as but not limited to mutS homologs, MSH2, MSH3, MSH6, GTBP; mutL homologs 
PMS1, PMS2, MLH1; and exonucleases and helicases such as MutH and MutY (Harfe, 
B.D. and S. Jinks-Robertson (2000) Ann. Rev. Genet. 34:359-399; Modrich, P. (1994) 
Science 266:1959-1960). Chemicals capable of blocking the expression of these genes can 

15 lead to the blockade of MMR. An example of a chemical that is capable of blocking 

MMR gene expression is an oligodeoxynucleotide that can specifically bind and degrade 
an MMR gene message and protein production as described by Chauhan DP, etal. (Clin 
Cancer Res (2000) 6:3827-3831). One can use a variety of screening methods described 
within this application to identify inhibitors that block the expression and/or function of 

20 MMR genes in vivo. 

DISCUSSION 

The results described herein demonstrate the use of chemicals that can block 
mismatch repair of host organisms in vivo to produce genetic mutations. The results also 
demonstrate the use of reporter systems in host cells and organisms that are useful for 

25 screening chemicals capable of blocking MMR of the host organism. Moreover, the 
results demonstrate the use of chemical inhibitors to block MMR in mammalian cells, 
microbes, and plants to produce organisms with new output traits. The data presented 
herein provide novel approaches for producing genetically altered plants, microbes, and 
mammalian cells with output traits for commercial applications by inhibiting MMR with 

30 chemicals. This approach gives advantages over others that require the use of recombinant 
techniques to block MMR or to produce new output traits by expression of a foreign gene. 
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This method will be useful in producing genetically altered host organisms for 
agricultural, chemical manufacturing, pharmaceutical, and environmental applications. 



PMS2 (mouse) (SEQ ID NO: 14) 

5 MEQTEGVSTE CAKAIKPIDG KSVHQICSGQ 
YGVDLIEVSD NGCGVEEENF EGLALKHHTS 
TISTCHGSAS VGTRLVFDHN GKITQKTPYP 
YSKMVQVLQA YCIISAGVRV SCTNQLGQGK 
PFVQLPPSDA VCEEYGLSTS GRHKTFSTFR 

10 RSLSLSMRFY HMYNRHQYPF WLNVSVDSE 
GMFDSDANKL NVNQQPLLDV EGNLVKLHTA 
LREAFSLHPT KEIKSRGPET AELTRSFPSE 
SPGDCMDREK IEKDSGLSST SAGSEEEFST 
DCRPPGTGQS LKPEDHGYQC KALPLARLSP 

15 EVDVAIKMNK RIVLLEFSLS SLAKRMKQLQ 
LRKEISKSMF AEMEILGQFN LGFIVTKLKE 
ITPQTLNLTA VNEAVLIENL EIFRKNGFDF 
DELIFMLSDS PGVMCRPSRV RQMFASRACR 
PHGRPTMRHV ANLDVISQN 

20 

PMS2 (mouse cDNA) (SEQ ID NO: 15) 



VILSLSTAVK ELIENSVDAG ATTIDLRLKD 60 
KIQEFADLTQ VETFGFRGEA LSSLCALSDV 120 
RPKGTTVSVQ HLFYTLPVRY KEFQRNIKKE 180 
RHAWCTSGT SGMKENI GSV FGQKQLQSLI 24 0 
ASFHSARTAP GGVQQTGSFS SSIRGPVTQQ 300 
CVDINVTPDK RQILLQEEKL LLAVLKTSLI 360 
ELEKPVPGKQ DNSPSLKSTA DEKRVASISR 4 20 
KRGVLSSYPS DVISYRGLRG SQDKLVSPTD 4 80 
PEVASSFSSD YNVSSLEDRP SQETINCGDL 540 
TNAKRFKTEE RPSNVNISQR LPGPQSTSAA 600 
HLKAQNKHEL SYRKFRAKIC PGENQAAEDE 660 
DLFLVDQHAA DEKYNFEMLQ QHTVLQAQRL 720 
VIDE DAP VTE RAKLISLPTS KNWTFGPQDI 780 
KSVMIGTALN ASEMKKLITH MGEMDHPWNC 84 0 

859 



gaattccggt gaaggtcctg aagaatttcc agattcctga gtatcattgg aggagacaga 60 

f a f^? tCg tca ^ taac 9 atggtgtata tgcaacagaa atgggtgttc ctggagacgc 120 
gtcttttccc gagagcggca ccgcaactct cccgcggtga ctgtgactgg aggagtcctg 180 
catccatgga gcaaaccgaa ggcgtgagta cagaatgtgc taaggccatc aagcctattg 240 
atgggaagtc agtccatcaa atttgttctg ggcaggtgat actcagttta agcaccgctg 300 
tgaaggagtt gatagaaaat agtgtagatg ctggtgctac tactattgat ctaaggctta 360 
an aa ^tatgg ggtggacctc attgaagttt cagacaatgg atgtggggta gaagaagaaa 4 20 
actttgaagg tctagctctg aaacatcaca catctaagat tcaagagttt gccgacctca 4 80 
cgcaggttga aactttcggc tttcgggggg aagctctgag ctctctgtgt gcactaagtg 540 
atgtcactat atctacctgc cacgggtctg caagcgttgg gactcgactg gtgtttgacc 600 
ataatgggaa aatcacccag aaaactccct acccccgacc taaaggaacc acagtcagtg 660 
tgcagcactt attttataca ctacccgtgc gttacaaaga gtttcagagg aacattaaaa 7 20 
3D aggagtattc caaaatggtg caggtcttac aggcgtactg tatcatctca gcaggcgtcc 780 
gtgtaagctg cactaatcag ctcggacagg ggaagcggca cgctgtggtg tgcacaagcg 
gcacgtctgg catgaaggaa aatatcgggt ctgtgtttgg ccagaagcag ttgcaaagcc 
tcattccttt tgttcagctg ccccctagtg acgctgtgtg tgaagagtac ggcctgagca 



840 

900 
960 



cttcaggacg ccacaaaacc ttttctacgt ttcgggcttc atttcacagt gcacgcacgg 10?0 
#U cgccgggagg agtgcaacag acaggcagtt tttcttcatc aatcagaggc cctgtgaccc " ~~ : ' 



1080 
1140 
1200 



agcaaaggtc tctaagcttg tcaatgaggt tttatcacat gtataaccgg catcagtacc 
catttgtcgt ccttaacgtt tccgttgact cagaatgtgt ggatattaat gtaactccag i.uu 
ataaaaggca aattctacta caagaagaga agctattgct ggccgtttta aagacctcct 1°60 
tgataggaat gtttgacagt gatgcaaaca agcttaatgt caaccagcag ccactgctaq 1320 
<K> atgttgaagg taacttagta aagctgcata ctgcagaact agaaaagcct gtgccaggaa 138 0 
agcaagataa ctctccttca ctgaagagca cagcagacga gaaaagggta gcatccatct 1440 
ccaggctgag agaggccttt tctcttcatc ctactaaaga gatcaagtct aggggtccaq 1500 
agactgctga actgacacgg agttttccaa gtgagaaaag gggcgtgtta tcctcttatc 1560 
cttcagacgt catctcttac agaggcctcc gtggctcgca ggacaaattg gtgagtccca 1620 
DU cggacagccc tggtgactgt atggacagag agaaaataga aaaagactca gggctcagca 1680 
gcacctcagc tggctctgag gaagagttca gcaccccaga agtggccagt agctttagca 1740 
gtgactataa cgtgagctcc ctagaagaca gaccttctca ggaaaccata aactgtgqtq 1800 
acctggactg ccgtcctcca ggtacaggac agtccttgaa gccagaagac catggatatc 18 60 
aatgcaaagc tctacctcta gctcgtctgt cacccacaaa tgccaagcgc ttcaaqacaq 1920 
53 aggaaagacc ctcaaatgtc aacatttctc aaagattgcc tggtcctcag agcacctcaq 1980 
cagctgaggt cgatgtagcc ataaaaatga ataagagaat cgtgctcctc gagttctctc 204 0 
tgagttctct agctaagcga atgaagcagt tacagcacct aaaggcgcag aacaaacatg 2100 
aactgagtta cagaaaattt agggccaaga tttgccctgg agaaaaccaa gcagcagaaq ?160 
^ f^gaactcag aaaagagatt agtaaatcga tgtttgcaga gatggagatc ttgggtcaqt 2220 
clan* ^ g ^ aaCtga "Wgacct Sttlltggtg gacl a gca?g 2280 

ctgcggatga gaagtacaac tttgagatgc tgcagcagca cacggtgctc caggcgcaga 2340 
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10 



ggctcatcac 
atctggaaat 
ctgaaagggc 
atatagatga 
gagtcagaca 
tcaatgcgag 
actgccccca 
actgacacac 
ttttaagtaa 
catttaaaag 
tgatccggtg 
agactcaatt 



accccagact 
attcagaaag 
taaattgatt 
actgatcttt 
gatgtttgct 
cgagatgaag 
cggcaggcca 
cccttgtagc 
tctgattatc 
cagtgttaag 
ggagctcatg 
caaggacaaa 



ctgaacttaa 
aatggctttg 
tccttaccaa 
atgttaagtg 
tccagagcct 
aagctcatca 
accatgaggc 
atagagttta 
gttgtacaaa 
gcaggcatga 
tgagcccagg 
aaaaaaaaga 



ctgctgtcaa 
actttgtcat 
ctagtaaaaa 
acagccctgg 
gtcggaagtc 
cccacatggg 
acgttgccaa 
ttacagattg 
aattagcatg 
tggagtgttc 
actttgagac 
tatttttgaa 



tgaagctgta 
tgatgaggat 
ctggaccttt 
ggtcatgtgc 
agtgatgatt 
tgagatggac 
tctggatgtc 
ttcggtttgc 
ctgctttaat 
ctctagctca 
cactccgagc 
gccttttaaa 



ctgatagaaa 
gctccagtca 
ggaccccaag 
cggccctcac 
ggaacggcgc 
cacccctgga 
atctctcaga 
aaagagaagg 
gtactggatc 
gctacttggg 
cacattcatg 
aaaaaa 



2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3056 



15 



20 



25 



30 
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40 
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55 



60 



PMS2 (human) (SEQ ID NO: 16) 

MERAESSSTE PAKAIKPIDR KSVHQICSGQ 
YGVDLIEVSD NGCGVEEENF EGLTLKHHTS 
TISTCHASAK VGTRLMFDHN GKIIQKTPYP 
YAKMVQVLHA YCIISAGIRV SCTNQLGQGK 
PFVQLPPSDS VCEEYGLSCS DALHNLFYIS 
KVCRLVNEVY HMYNRHQYPF WLNISVDSE 
GMFDSDVNKL NVSQQPLLDV EGNLIKMHAA 
REAFSLRHTT ENKPHSPKTP EPRRSPLGQK 
PSDPTDRAEV EKDSGHGSTS VDSEGFSIPD 
ETDDSFSDVD CHSNQEDTGC KFRVLPQPTN 
SASQVDVAVK INKKWPLDF SMSSLAKRIK 
EDELRKEISK TMFAEMEIIG QFNLGFIITK 
QRLIAPQTLN LTAVNEAVLI ENLEIFRKNG 
QDVDELIFML SDSPGVMCRP SRVKQMFASR 
WNCPHGRPTM RHIANLGVIS QN 

PMS2 (human cDNA) (SEQ ID NO: 17) 



cgaggcggat 
aaggccatca 
ctgagtctaa 
aatattgatc 
tgtggggtag 
caagagtttg 
tcactttgtg 
actcgactga 
agagggacca 
tttcaaagga 
atcatttcag 
cctgtggtat 
cagaagcagt 
gaagagtacg 
atttcacaat 
aaccggcggc 
tataatcgac 
gatatcaatg 
gcagttttaa 
agtcagcagc 
gaaaagccca 
aaagacgtgt 
aagcctcaca 
atgctgtctt 
gaggcagtga 
gactcggggc 
agtcactgca 
gtggactctc 
tcaaaccagg 
accccaaaca 
aagttagtaa 
aagaaagttg 
cat'catgaag 



cgggtgttgc 
aacctattga 
gcactgcggt 
taaagcttaa 
aagaagaaaa 
ccgacctaac 
cactgagcga 
tgtttgatca 
cagtcagcgt 
atattaagaa 
caggcatccg 
gcacaggtgg 
tgcaaagcct 
gtttgagctg 
gcacgcatgg 
cttgtgaccc 
accagtatcc" 
ttactccaga 
agacctcttt 
cactgctgga 
tggtagaaaa 
ccatttccag 
gcccaaagac 
ctagcacttc 
gttccagtca 
acggcagcac 
gcagcgagta 
aggagaaagc 
aagataccgg 
caaagcgttt 
atactcagga 
tgcccctgga 
cacagcaaag 



atccatggag 
tcggaagtca 
aaaggagtta 
ggactatgga 
cttcgaaggc 
tcaggttgaa 
tgtcaccatt 
caatgggaaa 
gcagcagtta 
ggagtatgcc 
tgtaagttgc 
aagccccagc 
cattcctttt 
ttcggatgct 
agttggaagg 
agcaaaggtc 
atttgttgtt 
taaaaggcaa 
gataggaatg 
tgttgaaggt 
gcaggatcaa 
actgcgagag 
tccagaacca 
aggtgccatc 
cggacccagt 
ttccgtggat 
tgcggccagc 
gcctgaaact 
atgtaaattt 
taaaaaagaa 
catgtcagcc 
cttttctatg 
tgaaggggaa 



WLSLSTAVK 
KIQEFADLTQ 
RPRGTTVSVQ 
RQPWCTGGS 
GFISQCTHGV 
CVDINVTPDK 
DLEKPMVEKQ 
RGMLSSSTSG 
TGSHCSSEYA 
LATPNTKRFK 
QLHHEAQQSE 
LNEDIFIVDQ 
FDFVIDENAP 
ACRKSVMIGT 



cgagctgaga 
gtccatcaga 
gtagaaaaca 
gtggatctta 
ttaactctga 
acttttggct 
tctacctgcc 
attatccaga 
ttttccacac 
aaaatggtcc 
accaatcagc 
ataaaggaaa 
gttcagctgc 
ctgcataatc 
agttcaacag 
tgcagactcg 
cttaacattt 
attttgctac 
tttgatagtg 
aacttaataa 
tccccttcat 
gccttttctc 
agaaggagcc 
tctgacaaag 
gaccctacgg 
tctgaggggt 
tccccagggg 
gacgactctt 
cgagttttgc 
gaaattcttt 
tctcaggttg 
agttctttag 
cagaattaca 



ELVENSLDAG 
VETFGFRGEA 
QLFSTLPVRH 
PSIKENIGSV 
GRSSTDRQFF 
RQILLQEEKL 
DQSPSLRTGE 
AISDKGVLRP 
ASSPGDRGSQ 
KEEILSSSDI 
GEQNYRKFRA 
HATDEKYNFE 
VTERAKLISL 
ALNTSEMKKL 



gctcgagtac 
tttgctctgg 
gtctggatgc 
ttgaagtttc 
aacatcacac 
ttcgggggga 
acgcatcggc 
aaacccccta 
tacctgtgcg 
aggtcttaca 
ttggacaagg 
atatcggctc 
cccctagtga 
ttttttacat 
acagacagtt 
tgaatgaggt 
ctgttgattc 
aagaggaaaa 
atgtcaacaa 
aaatgcatgc 
taaggactgg 
ttcgtcacac 
ctctaggaca 
gcgtcctgag 
acagagcgga 
tcagcatccc 
acaggggctc 
tttcagatgt 
ctcagccaac 
ccagttctga 
atgtagctgt 
ctaaacgaat 
ggaagtttag 



ATNIDLKLKD 
LSSLCALSDV 
KEFQRNIKKE 
FGQKQLQSLI 
FINRRPCDPA 
LLAVLKTSLI 
EKKDVSISRL 
QKEAVSSSHG 
EHVDSQEKAP 
CQKLVNTQDM 
KICPGENQAA 
MLQQHTVLQG 
PTSKNWTFGP 
ITHMGEMDHP 



agaacctgct 
gcaggtggta 
tggtgccact 
agacaatgga 
atctaagatt 
agctctgagc 
gaaggttgga 
cccccgcccc 
ccataaggaa 
tgcatactgt 
aaaacgacag 
tgtgtttggg 
ctccgtgtgt 
ctcaggtttc 
tttctttatc 
ctaccacatg 
agaatgcgtt 
gcttttgttg 
gctaaatgtc 
agcggatttg 
agaagaaaaa 
aacagagaac 
gaaaaggggt 
acctcagaaa 
ggtggagaag 
agacacgggc 
gcaggaacat 
ggactgccat 
taatctcgca 
catttgtcaa 
gaaaattaat 
aaagcagtta 
ggcaaagatt 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

862 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 
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10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



60 



tgtcctggag 
tttgcagaaa 
gaggatatct 
cagcagcaca 
gctgttaatg 
tttgttatcg 
a^taaaaact 
agccctgggg 
cggaagtcgg 
cacatggggg 
atcgccaacc 
tttatcgcag 
atgaaacctg 
cttttcaaac 



aaaatcaagc 
tggaaatcat 
tcatagtgga 
ccgtgctcca 
aagctgttct 
atgaaaatgc 
ggaccttcgg 
tcatgtgccg 
tgatgattgg 
agatggacca 
tgggtgtcat 
atttttatgt 
ctacttaaaa 
c 



agccgaagat 
tggtcagttt 
ccagcatgcc 
ggggcagagg 
gatagaaaat 
tccagtcact 
accccaggac 
gccttcccga 
gactgctctt 
cccctggaac 
ttctcagaac 
tttgaaagac 
aaaatacaca 



gaactaagaa 
aacctgggat 
acggacgaga 
ctcatagcac 
ctggaaatat 
gaaagggcta 
gtcgatgaac 
gtcaagcaga 
aacacaagcg 
tgtccccatg 
tgaccgtagt 
agagtcttca 
tcacacccat 



aagagataag 
ttataataac 
agtataactt 
ctcagactct 
ttagaaagaa 
aactgatttc 
tgatcttcat 
tgtttgcctc 
agatgaagaa 
gaaggccaac 
cactgtatgg 
ctaacctttt 
ttaaaagtga 



taaaacgatg 
caaactgaat 
cgagatgctg 
caacttaact 
tggctttgat 
cttgccaact 
gctgagcgac 
cagagcctgc 
actgatcacc 
catgagacac 
aataattggt 
ttgttttaaa 
tcttgagaac 



2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2771 



PMS 1 (human) (SEQ ID NO: 1 8) 

MKQLPAATVR LLSSSQIITS WSWKELIE NSLDAGATSV 
IKAVDAPVMA MKYYTSKINS HEDLENLTTY GFRGEALGSI 
YVLDGSGHIL SQKPSHLGQG TTVTALRLFK NLPVRKQFYS 
ILKPDLRIVF VHNKAVIWQK SRVSDHKMAL MSVLGTAVMN 
PKCDADHSFT SLSTPERSFI FINSRPVHQK DILKLIRHHY 
VPTADVDVNL TPDKSQVLLQ NKESVLIALE NLMTTCYGPL 
SKTAETDVLF NKVESSGKNY SNVDTSVIPF QNDMHNDESG 
CSSEISNIDK NTKNAFQDIS MSNVSWENSQ TEYSKTCFIS 
NEEEAGLENS SEISADEWSR GNILKNSVGE NIEPVKILVP 
LNEDSCNKKS NVIDNKSGKV TAYDLLSNRV IKKPMSASAL 
ATLQIEELWK TLSEEEKLKY EEKATKDLER YNSQMKRAIE 
NLAQKHKLKT SLSNQPKLDE LLQSQIEKRR SQNIKMVQIP 
KDEPCLIHNL RFPDAWLMTS KTEVMLLNPY RVEEALLFKR 
SLFNGSHYLD VLYKMTADDQ RYSGSTYLSD PRLTANGFKI 
- CLPFYGVADL KEILNAILNR NAKEVYECRP RKVISYLEGE 
IYRMKHQFGN EIKECVHGRP FFHHLTYLPE TT 

PMS1 (human) (SEQ ID NO: 19) 

ggcacgagtg gctgcttgcg gctagtggat ggtaattgcc 
ctgctctgtt aaaagcgaaa atgaaacaat tgcctgcggc 
gttctcagat catcacttcg gtggtcagtg ttgtaaaaga 
atgctggtgc cacaagcgta gatgttaaac tggagaacta 
tgcgagataa cggggagggt atcaaggctg ttgatgcacc 
acacctcaaa aataaatagt catgaagatc ttgaaaattt 
gagaagcctt ggggtcaatt tgttgtatag ctgaggtttt 
ctgataattt tagcacccag tatgttttag atggcagtgg 
cttcacatct tggtcaaggt acaactgtaa ctgctttaag 
taagaaagca gttttactca actgcaaaaa aatgtaaaga 
atctcctcat gagctttggt atccttaaac ctgacttaag 
aggcagttat ttggcagaaa agcagagtat cagatcacaa 
tggggactgc tgttatgaac aatatggaat cctttcagta 
tttatctcag tggatttctt ccaaagtgtg atgcagacca 
caccagaaag aagtttcatc ttcataaaca gtcgaccagt 
agttaatccg acatcattac aatctgaaat gcctaaagga 
ttttctttct gaaaatcgat gttcctacag ctgatgttga 
aaagccaagt attattacaa aataaggaat ctgttttaat 
cgacttgtta tggaccatta cctagtacaa attcttatga 
ccgcagctga catcgttctt agtaaaacag cagaaacaga 
aatcatctgg aaagaattat tcaaatgttg atacttcagt 
tgcataatga tgaatctgga aaaaacactg atgattgttt 
gtgactttgg ttatggtcat tgtagtagtg aaatttctaa 
atgcatttca ggacatttca atgagtaatg tatcatggga 
gtaaaacttg ttttataagt tccgttaagc acacccagtc 
atatagatga gagtggggaa aatgaggaag aagcaggtct 
ctgcagatga gtggagcagg ggaaatatac ttaaaaattc 
ctgtgaaaat tttagtgcct gaaaaaagtt taccatgtaa 
caatccctga acaaatgaat cttaatgaag attcatgtaa 
ataataaatc tggaaaagtt acagcttatg atttacttag 



DVKLENYGFD 
CCIAEVLITT 
TAKKCKDEIK 
NMESFQYHSE 
NLKCLKESTR 
PSTNSYENNK 
KNTDDCLNHQ 
SVKHTQSENG 
EKSLPCKVSN 
FVQDHRPQFL 
QESQMSLKDG 
FSMKNLKINF 
LLENHKLPAE 
KLIPGVSITE 
AVRLSRQLPM 



KIEVRDNGEG 
RTAADNFSTQ 
KIQDLLMSFG 
ESQIYLSGFL 
LYPVFFLKID 
TDVSAADIVL 
ISIGDFGYGH 
NKDHIDESGE 
NNYPIPEQMN 
IENPKTSLED 
RKKIKPTSAW 
KKQNKVDLEE 
PLEKPIMLTE 
NYLEIEGMAN 
YLSKEDIQDI 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
932 



tgcctcgcgc 
aacagttcga 
gcttattgaa 
tggatttgat 
tgtaatggca 
gacaacttac 
aattacaaca 
ccacatactt 
attatttaag 
tgaaataaaa 
gattgtcttt 
gatggctctc 
ccactctgaa 
ctctttcact 
acatcaaaaa 
atctactcgt 
tgtaaattta 
tgctcttgaa 
aaataataaa 
tgtgcttttt 
cattccattc 
aaatcaccag 
cattgataaa 
gaactctcag 
agaaaatggc 
tgaaaactct 
agtgggagag 
agtaagtaat 
caaaaaatca 
caatcgagta 



tagcagcaag 

ctcctttcaa 

aactccttgg 

aaaattgagg 

atgaagtact 

ggttttcgtg 

agaacggctg 

tctcagaaac 

aatctacctg 

aagatccaag 

gtacataaca 

atgtcagttc 

gaatctcaga 

agtctttcaa 

gatatcttaa 

ttgtatcctg 

acaccagata 

aatctgatga 

acagatgttt 

aataaagtgg 

caaaatgata 

ataagtattg 

aacactaaga 

acggaatata 

aataaagacc 

tcggaaattt 

aatattgaac 

aataattatc 

aatgtaatag 

atcaagaaac 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 
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ccatgtcagc aagtgctctt tttgttcaag atcatcgtcc tcagtttctc atagaaaatc 18 60 

ctaagactag tttagaggat gcaacactac aaattgaaga actgtggaag acattgagtg 1920 

aagaggaaaa actgaaatat gaagagaagg ctactaaaga cttggaacga tacaatagtc 1980 

aaatgaagag agccattgaa caggagtcac aaatgtcact aaaagatggc agaaaaaaga 204 0 

5 taaaacccac cagcgcatgg aatttggccc agaagcacaa gttaaaaacc tcattatcta 2100 

atcaaccaaa acttgatgaa ctccttcagt cccaaattga aaaaagaagg agtcaaaata 2160 

ttaaaatggt acagatcccc ttttctatga aaaacttaaa aataaatttt aagaaacaaa 2220' 

acaaagttga cttagaagag aaggatgaac cttgcttgat ccacaatctc aggtttcctg 2280 

atgcatggct aatgacatcc aaaacagagg taatgttatt aaatccatat agagtagaag 2340 

10 aagccctgct atttaaaaga cttcttgaga atcataaact tcctgcagag ccactggaaa 24 00 

agccaattat gttaacagag agtcttttta atggatctca ttatttagac gttttatata 24 60 

aaatgacagc agatgaccaa agatacagtg gatcaactta cctgtctgat cctcgtctta 2520 

cagcgaatgg tttcaagata aaattgatac caggagtttc aattactgaa aattacttgg 2580 

aaatagaagg aatggctaat tgtctcccat tctatggagt agcagattta aaagaaattc 2640 

15 ttaatgctat attaaacaga aatgcaaagg aagtttatga atgtagacct cgcaaagtga 27 00 

taagttattt agagggagaa gcagtgcgtc tatccagaca attacccatg tacttatcaa 27 60 

aagaggacat ccaagacatt atctacagaa tgaagcacca gtttggaaat gaaattaaag 2820 

agtgtgttca tggtcgccca ttttttcatc atttaaccta tcttccagaa actacatgat 2880 

taaatatgtt taagaagatt agttaccatt gaaattggtt ctgtcataaa acagcatgag 2940 

20 tctggtttta aattatcttt gtattatgtg tcacatggtt attttttaaa tgaggattca 3000 

ctgacttgtt tttatattga aaaaagttcc acgtattgta gaaaacgtaa ataaactaat 3060 
aac 3063 



25 MSH2 (human) (SEQ ID NO:20) 

MAVQPKETLQ LESAAEVGFV RFFQGMPEKP TTTVRLFDRG DFYTAHGEDA LLAAREVFKT 60 

QGVIKYMGPA GAKNLQSWL SKMNFESFVK DLLLVRQYRV EVYKNRAGNK ASKENDWYLA 120 

YKASPGNLSQ FEDILFGNND MSASIGWGV KMSAVDGQRQ VGVGYVDSIQ RKLGLCEFPD 180 

NDQFSNLEAL LIQIGPKECV LPGGETAGDM GKLRQIIQRG GILITERKKA DFSTKDIYQD 24 0 

30 LNRLLKG KKG EQMNSAVLPE MENQVAVSSL SAVIKFLELL SDDSNFGQFE LTTFDFSQYM 300 

KLDIAAVRAL NLFQGSVEDT TGSQSLAALL NKCKTPQGQR LVNQWIKQPL MDKNRIEERL 360 

NLVEAFVEDA ELRQTLQEDL LRRFPDLNRL AKKFQRQAAN LQDCYRLYQG INQLPNVIQA 420 

LEKHEGKHQK LLLAVFVTPL TDLRSDFSKF QEMIETTLDM DQVENHEFLV KPSFDPNLSE 4 80 

LREIMNDLEK KMQSTLISAA RDLGLDPGKQ IKLDSSAQFG YYFRVTCKEE KVLRNNKNFS 54 0 

35 TVDIQKNGVK FTNSKLTSLN EEYTKNKTEY EEAQDAIVKE IVNISSGYVE PMQTLNDVLA 600 

QLDAWSFAH VSNGAPVPYV RPAILEKGQG RIILKASRHA CVEVQDEIAF I PNDVYFEKD 660 

KQMFHIITGP NMGGKSTYIR QTGVIVLMAQ IGCFVPCESA EVSIVDCILA RVGAGDSQLK 720 

GVSTFMAEML ETASILRSAT KDSLIIIDEL GRGTSTYDGF GLAWAISEYI ATKIGAFCMF 780 

ATHFHELTAL ANQIPTVNNL HVTALTTEET LTMLYQVKKG VCDQSFGIHV AELANFPKHV 84 0 

40 IECAKQKALE LEEFQYIGES QGYDIMEPAA KKCYLEREQG EKIIQEFLSK VKQMPFTEMS 900 

EENITIKLKQ LKAEVIAKNN SFVNEIISRI KVTT 934 



MSH2 (human cDNA) (SEQ ID NO:21) 

ggcgggaaac agcttagtgg gtgtggggtc gcgcattttc ttcaaccagg aggtgaggag 60 

45 gtttcgacat ggcggtgcag ccgaaggaga cgctgcagtt ggagagcgcg gccgaggtcg 120 

gcttcgtgcg cttctttcag ggcatgccgg agaagccgac caccacagtg cgccttttcg 180 

accggggcga cttctatacg gcgcacggcg aggacgcgct gctggccgcc cgggaggtgt 24 0 

tcaagaccca gggggtgatc aagtacatgg ggccggcagg agcaaagaat ctgcagagtg 300 

ttgtgcttag taaaatgaat tttgaatctt ttgtaaaaga tcttcttctg gttcgtcagt 360 

50 atagagttga agtttataag aatagagctg gaaataaggc atccaaggag aatgattggt 4 20 

atttggcata taaggcttct cctggcaatc tctctcagtt tgaagacatt ctctttggta 4 80 

acaatgatat gtcagcttcc attggtgttg tgggtgttaa aatgtccgca gttgatggcc 54 0 

agagacaggt tggagttggg tatgtggatt ccatacagag gaaactagga ctgtgtgaat 600 

tccctgataa tgatcagttc tccaatcttg aggctctcct catccagatt ggaccaaagg 660 

55 aatgtgtttt acccggagga gagactgctg gagacatggg gaaactgaga cagataattc 720 

aaagaggagg aattctgatc acagaaagaa aaaaagctga cttttccaca aaagacattt 780 

atcaggacct caaccggttg ttgaaaggca aaaagggaga gcagatgaat agtgctgtat 84 0 

tgccagaaat ggagaatcag gttgcagttt catcactgtc tgcggtaatc aagtttttag 900 

aactcttatc agatgattcc aactttggac agtttgaact gactactttt gacttcagcc 960 

60 agtatatgaa attggatatt gcagcagtca gagcccttaa cctttttcag ggttctgttg 1020 

aagataccac tggctctcag tctctggctg ccttgctgaa taagtgtaaa acccctcaag 1080 

gacaaagact tgttaaccag tggattaagc agcctctcat ggataagaac agaatagagg 1140 

agagattgaa tttagtggaa gcttttgtag aagatgcaga attgaggcag actttacaag 1200 
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15 



20 



25 



30 



40 



45 



55 



60 



aagatttact 

cagcaaactt 

tacaggctct 

ctcctcttac 

tagatatgga 

tcagtgaatt 

gtgcagccag 

agtttggata 

actttagtac 

ctttaaatga 

ttaaagaaat 

tgttagctca 

catatgtacg 

ggcatgcttg 

aaaaagataa 

atattcgaca 

agtcagcaga 

aattgaaagg 

ctgcaaccaa 

atggatttgg 

gcatgtttgc 

ataatctaca 

agaaaggtgt 

agcatgtaat 

gagaatcgca 

agcaaggtga 

aaatgtcaga 

agaataatag 

cagtaatgga 

atattaaccc 

atatttagta 

gctgtaactg 

ataaataaaa 



tcgtcgattc 

acaagattgt 

ggaaaaacat 

tgatcttcgt 

tcaggtggaa 

aagagaaata 

agatcttggc 

ttactttcgt 

tgtagatatc 

agagtatacc 

tgtcaatatt 

gctagatgct 

accagccatt 

tgttgaagtt 

acagatgttc 

aactggggtg 

agtgtccatt 

agtctccacg 

agattcatta 

gttagcatgg 

aacccatttt 

tgtcacagca 

ctgtgatcaa 

agagtgtgct 

aggatatgat 

aaaaattatt 

agaaaacatc 

ctttgtaaat 

atgaaggtaa 

tttttccata 

atattttact 

aggactgttt 

tcatgtagtt 



ccagatctta 

taccgactct 

gaaggaaaac 

tctgacttct 

aaccatgaat 

atgaatgact 

ttggaccctg 

gtaacctgta 

cagaagaatg 

aaaaataaaa 

tcttcaggct 

gttgtcagct 

ttggagaaag 

caagatgaaa 

cacatcatta 

atagtactca 

gtggactgca 

ttcatggctg 

ataatcatag 

gctatatcag 

catgaactta 

ctcaccactg 

agttttggga 

aaacagaaag 

atcatggaac 

caggagttcc 

acaataaagt 

gaaatcattt 

tattgataag 

gtgttaactg 

ttgaggacat 

gcaattgaca 

tgtgg 



accgacttgc 

atcagggtat 

accagaaatt 

ccaagtttca 

tccttgtaaa 

tggaaaagaa 

gcaaacagat 

aggaagaaaa 

gtgr.taaatt 

cagaatatga 

atgtagaacc 

ttgctcacgt 

gacaaggaag 

ttgcatttat 

ctggccccaa 

tggcccaaat 

tcttagcccg 

aaatgttgga 

atgaattggg 

aatacattgc 

ctgccttggc 

aagagacctt 

ttcatgttgc 

ccctggaact 

cagcagcaaa 

tgtccaaggt 

taaaacagct 

cacgaataaa 

ctattgtctg 

tcagtgccca 

tttcaaagat 

taggcaataa 



caagaagttt 

aaatcaacta 

attgttggca 

ggaaatgata 

accttcattt 

gatgcagtca 

taaactggat 

agtccttcgt 

taccaacagc 

agaagcccag 

aatgcagaca 

gtcaaatgga 

aattatatta 

tcctaatgac 

tatgggaggt 

tgggtgtttt 

agtaggggct 

aactgcttct 

aagaggaact 

aacaaagatt 

caatcagata 

aactatgctt 

agagcttgct 

tgaggagttt 

gaagtgctat 

gaaacaaatg 

aaaagctgaa 

agttactacg 

taatagtttt 

tgggctatca 

ttttattttg 

taagtgatgt 



caaagacaag 

cctaatgtta 

gtttttgtga 

gaaacaactt 

gatcctaatc 

acattaataa 

tccagtgcac 

aacaataaaa 

aaattgactt 

gatgccattg 

ctcaatgatg 

gcacctgttc 

aaagcatcca 

gtatactttg 

aaatcaacat 

gtgccatgtg 

ggtgacagtc 

atcctcaggt 

tctacctacg 

ggtgcttttt 

ccaactgtta 

tatcaggtga 

aatttcccta 

cagtatattg 

ctggaaagag 

ccctttactg 

gtaatagcaa 

tgaaaaatcc 

atattgtttt 

acttaataag 

aaaaatgaga 

gctgaatttt 



1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400 

2460 

2520 

2580 

2640 

2700 

2760 

2820 

2880 

2940 

3000 

3060 

3120 

3145 



35 MLH1 (human) (SEQ ID NO:22) 



MSFVAGVIRR 
IQDNGTGIRK 
DGKCAYRASY 
GRYSVHNAGI 
KMNGYISNAN 
QNVDVNVHPT 
KSTTSLTSSS 
SGRARQQDEE 
MVEDDSRKEM 
AQHQTKLYLL 
DGPKEGLAEY 
ATEVNWDEEK 
YKALRSHILP 



LDETWNRIA 
EDLDIVCERF 
SDGKLKAPPK 
SFSVKKQGET 
YSVKKCIFLL 
KHEVHFLHEE 
TSGSSDKVYA 
MLELPAPAEV 
TAACTPRRRI 
NTTKLSEELF 
IVEFLKKKAE 
ECFESLSKEC 
PKHFTEDGNI 



AGEVIQRPAN 
TTSKLQSFED 
PCAGNQG TQ I 
VADVRTLPNA 
FINHRLVEST 
SILERVQQHI 
HQMVRTDSRE 
AAKNQSLEGD 
INLTSVLSLQ 
YQILIYDFAN 
MLADYFSLEI 
AMFYSIRKQY 
LQLANLPDLY 



AIKEMIENCL 
LASISTYGFR 
TVEDLFYNIA 
STVDNIRSIF 
SLRKAIETVY 
ESKLLGSNSS 
QKLDAFLQPL 
TTKGTSEMSE 
EEINEQGHEV 
FGVLRLSEPA 
DEEGNLIGLP 
ISEESTLSGQ 
KVFERC 



DAKSTSIQVI 
GEALASISHV 
TRRKALKNPS 
GNAVSRELIE 
AAYLPKNTHP 
RMYFTQTLLP 
SKPLSSQPQA 
KRGPTSSNPR 
LREMLHNHSF 
PLFDLAMLAL 
LLIDNYVPPL 
QSEVPGSIPN 



VKEGGLKLIQ 
AHVTITTKTA 
EEYGKILEW 
IGCEDKTLAF 
FLYLSLEISP 
GLAGPSGEMV 
IVTEDKTDIS 
KRHREDSDVE 
VGCVNPQWAL 
DSPESGWTEE 
EGLPIFILRL 
SWKWTVEHIV 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
756 



50 MLH1 (human) (SEQ ID NO:23) 



cttggctctt 
acagtggtga 
gagatgattg 
ggaggcctga 
gatattgtat 
atttctacct 
actattacaa 
aaactgaaag 
gacctttttt 
gggaaaattt 
gttaaaaaac 
gacaatattc 
gaggataaaa 



ctggcgccaa 
accgcatcgc 
agaactgttt 
agttgattca 
gtgaaaggtt 
atggctttcg 
cgaaaacagc 
cccctcctaa 
acaacatagc 
tggaagttgt 
aaggagagac 
gctccatctt 
ccctagcctt 



aatgtcgttc 
ggcgggggaa 
agatgcaaaa 
gatccaagac 
cactactagt 
aggtgaggct 
tgatggaaag 
accatgtgct 
cacgaggaga 
tggcaggtat 
agtagctgat 
tggaaatgct 
caaaatgaat 



gtggcagggg 
gttatccagc 
tccacaagta 
aatggcaccg 
aaactgcagt 
ttggccagca 
tgtgcataca 
ggcaatcaag 
aaagctttaa 
tcagtacaca 
gttaggacac 
gttagtcgag 
ggttacatat 



ttattcggcg 
ggccagctaa 
ttcaagtgat 
ggatcaggaa 
cctttgagga 
taagccatgt 
gagcaagtta 
ggacccagat 
aaaatccaag 
atgcaggcat 
tacccaatgc 
aactgataga 
ccaatgcaaa 



gctggacgag 
tgctatcaaa 
tgttaaagag 
agaagatctg 
tttagccagt 
ggctcatgtt 
ctcagatgga 
cacggtggag 
tgaagaatat 
tagtttctca 
ctcaaccgtg 
aattggatgt 
ctactcagtg 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

7 80 
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aagaagtgca tcttcttact cttcatcaac catcgtctgg tagaatcaac ttccttgaga 840 
aaagccatag aaacagtgta tgcagcctat ttgcccaaaa acacacaccc attcctgtac 900 
ctcagtttag aaatcagtcc ccagaatgtg gatgttaatg tgcaccccac aaagcatgaa 960 
gttcacttcc tgcacgagga gagcatcctg gagcgggtgc agcagcacat cgagagcaag 1020 
5 ctcctgggct ccaattcctc caggatgtac ttcacccaga ctttgctacc aggacttgct 1080 
ggcccctctg gggagatggt taaatccaca acaagtctga cctcgtcttc tacttctgga 1140 
agtagtgata aggtctatgc ccaccagatg gttcgtacag attcccggga acagaagctt 1200 
gatgcatttc tgcagcctct gagcaaaccc ctgtccagtc agccccaggc cattgtcaca 1260 
gaggataaga cagatatttc tagtggcagg gctaggcagc aagatgagga gatgcttgaa 1320 

10 ctcccagccc ctgctgaagt ggctgccaaa aatcagagct tggaggggga tacaacaaag 1380 
gggacttcag aaatgtcaga gaagagagga cctacttcca gcaaccccag aaagagacat 14 40 
cgggaagatt ctgatgtgga aatggtggaa gatgattccc gaaaggaaat gactgcagct 1500 
tgtacccccc ggagaaggat cattaacctc actagtgttt tgagtctcca ggaagaaatt 1560 
aatgagcagg gacatgaggt tctccgggag atgttgcata accactcctt cgtgggctgt 1620 

15 gtgaatcctc agtgggcctt ggcacagcat caaaccaagt tataccttct caacaccacc 1680 
aagcttagtg aagaactgtt ctaccagata ctcatttatg attttgccaa ttttggtgtt 1740 
ctcaggttat cggagccagc accgctcttt gaccttgcca tgcttgcctt agatagtcca 1800 
gagagtggct ggacagagga agatggtccc aaagaaggac ttgctgaata cattgttgag 18 60 
tttctgaaga agaaggctga gatgcttgca gactatttct ctttggaaat tgatgaggaa 1920 

20 gggaacctga ttggattacc ccttctgatt gacaactatg tgcccccttt ggagggactg 1980 
cctatcttca ttcttcgact agccactgag gtgaattggg acgaagaaaa ggaatgtttt 2040 
gaaagcctca gtaaagaatg cgctatgttc tattccatcc ggaagcagta catatctgag 2100 
gagtcgaccc tctcaggcca gcagagtgaa gtgcctggct ccattccaaa ctcctggaag 2160 
tggactgtgg aacacattgt ctataaagcc ttgcgctcac acattctgcc tcctaaacat 2220 

25 ttcacagaag atggaaatat cctgcagctt gctaacctgc ctgatctata caaagtcttt 2280 
gagaggtgtt aaatatggtt atttatgcac tgtgggatgt gttcttcttt ctctgtattc 234 0 
cgatacaaag tgttgtatca aagtgtgata tacaaagtgt accaacataa gtgttggtag 24 00 
cacttaagac ttatacttgc cttctgatag tattccttta tacacagtgg attgattata 24 60 
aataaataga tgtgtcttaa cata 24 84 

30 

hPMS2-134 (human) (SEQ ID NO:24) 

MERAESSSTE PAKAIKPIDR KSVHQICSGQ WLSLSTAVK ELVENSLDAG ATOIDLKLKD 60 
YGVDLIEVSD NGCGVEEENF EGLTLKHHTS KIQEFADLTQ VETFGFRGEA LSSLCALSDV 120 
TISTCHASAK VGT 133 

35 

KPMS2-134 (human cDNA) (SEQ ID NO;25) 

cgaggcggat cgggtgttgc atccatggag cgagctgaga gctcgagtac agaacctgct 60 
aaggccatca aacctattga tcggaagtca gtccatcaga tttgctctgg gcaggtggta 120 
ctgagtctaa gcactgcggt aaaggagtta gtagaaaaca gtctggatgc tggtgccact 180 
40 aatattgatc taaagcttaa ggactatgga gtggatctta ttgaagtttc agacaatgga 240 
tgtggggtag aagaagaaaa cttcgaaggc ttaactctga aacatcacac atctaagatt 300 
caagagtttg ccgacctaac tcaggttgaa acttttggct ttcgggggga agctctgagc 360 
tcactttgtg cactgagcga tgtcaccatt tctacctgcc acgcatcggc gaaggttgga 420 
acttga 426 

45 

GTBP (human) (SEQ ID NO:26) 

MSRQSTLYSF FPKSPALSDA NKASARASRE GGRAAAAPGA SPSPGGDAAW SEAGPGPRPL 60 
ARSASPPKAK NLNGGLRRSV APAAPTSCDF SPGDLVWAKM EGYPWWPCLV YNHPFDGTFI 120 
REKGKSVRVH VQFFDDSPTR GWVSKRLLKP YTGSKSKEAQ KGGHFYSAKP EILRAMQRAD 180 

50 EALNKDKIKR LEI^AVCDEPS EPEEEEEMEV GTTYVTDKSE EDNEIESEEE VQPKTQGSRR 240 
SSRQIKKRRV ISDSESDIGG SDVEFKPDTK EEGSSDEISS GVGDSESEGL NSPVKVARKR 300 
KRMVTGNGSL KRKSSRKETP SATKQATSIS SETKNTLRAF SAPQNSESQA HVSGGGDDSS 3 60 
RPTVWYHETL EWLKEEKRRD EHRRRPDHPD FDASTLYVPE DFLNSCTPGM RKWWQIKSQN 4 20 
FDLVICYKVG KFYELYHMDA LIGVSELGLV FMKGNWAHSG FPEIAFGRYS DSLVQKGYKV 4 80 

55 ARVEQTETPE MMEARCRKMA HISKYDRWR REICRIITKG TQTYSVLEGD PSENYSKYLL 540 
SLKEKEEDSS GHTRAYGVCF VDTSLGKFFI GQFSDDRHCS RFRTLVAHYP PVQVLFEKGN 600 
LSKETKTILK SSLSCSLQEG LIPGSQFWDA SKTLRTLLEE EYFREKLSDG IGVMLPQVLK 660 
GMTSESDSIG LTPGEKSELA LSALGGCVFY LKKCLIDQEL LSMANFEEYI PLDSDTVSTT 7 20 
RSGAI FT KAY QRMVLDAVTL NNLEIFLNGT NGSTEGTLLE RVDTCHTPFG KRLLKQWLCA 780 

60 PLCNHYAIND RLDAIEDLMV VPDKISEWE LLPOCLPDLER LLSKIHNVGS PLKSQNHPDS 84 0 
RAIMYEETTY SKKKIIDFLS ALEGFKVMCK IIGIMEEVAD GFKSKILKQV ISLQTKNPEG 900 
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RFPDLTVELN 
RIGCRTIVYW 
AEERRDVSLK 
LPEDTPPFLE 
STLMRQAGLL 
LMHATAHSLV 
AVRLGHMACM 
REFEKMNQSL 



RWDTAFDHEK 
GIGRNRYQLE 
DCMRRLFYNF 
LKGSRHPCIT 
AVMAQMGCYV 
LVDELGRGTA 
VENECEDPSQ 
RLFREVCLAS 



ARKTGLITPK 
IPENFTTRNL 
DKNYKDWQSA 
KTFFGDDFIP 
PAEVCRLTPI 
TFDGTAIANA 
ETITFLYKFI 
ERSTVDAEAV 



AGFDSDYDQA 
PEEYELKSTK 
VECIAVLDVL 
NDILIGCEEE 
DRVFTRLGAS 
WKELAETIK 
KGACPK^YGF 
HKLLTLIKEL 



LADIRENEQS 
KGCKRYWTKT 
LCLANYSRGG 
EQENGKAYCV 
DRIMSGESTF 
CRTLFSTHYH 
NAARLANLPE 



LLEYLEKQRN 960 

IEKKLANLIM 1020 

DGPMCRPVIL 103 0 

LVTGPNMGGK 114 0 

FVELSETASI 1200 

SLVEDYSQNV 1260 

EVIQKGHRKA 1320 
1360 



10 GTBP (human cDNA) (SEQ ID NO:27) 



gccgcgcggt 
ggctgtcggt 
gagtgatgcc 
ccccggggcc 
15 caggcccttg 
gagatcggta 
ggccaagatg 
aacattcatc 
cccaacaagg 
20 ggaagcccag 
acgtgcagat 
tgagccctca 
taagagtgaa 
atctaggcga 
25 cattggtggc 
aataagcagt 
tcgaaagcgg 
ggaaacgccc 
gagagctttc 
30 tgacagtagt 
gagaagagat 
tgtgcctgag 
gtctcagaac 
catggatgct 
35 ccattctggc 
ctataaagta 
aaagatggca 
taccaagggt 
gtatcttctt 
40 tgtgtgcttt 
ccattgttcg 
aaaaggaaat 
tcaggaaggt 
ccttgaggaa 
45 ggtgcttaaa 
tgaattggcc 
tcaggagctt 
cagcactaca 
agtgacatta 
50 cctactagag 
gctttgtgcc 
cctcatggtt 
tcttgagagg 
cccagacagc 
55 ttttctttct 
agttgctgat 
tcctgaaggt 
ccatgaaaag 
tgaccaagct 
60 acagcgcaac 
ccagctggaa 
atctaccaag 
tctcataaat 
ctataacttt 



agatgcggtg 
atgtcgcgac 
aacaaggcct 
tctccttccc 
gcgcgctccg 
gcgcctgctg 

gagggttacc 

cgcgagaaag 
ggctgggtta 
aagggaggtc 
gaagccttaa 
gagccagaag 
gaagataatg 
agtagccgcc 
tctgatgtgg 
ggagtggggg 
aagagaatgg 
tcagccacca 
tctgcccctc 
cgccctactg 
gagcacagga 
gatttcctca 
tttgatcttg 
cttattggag 
tttcctgaaa 
gcacgagtgg 
catatatcca 
acacagactt 
agcctcaaag 
gttgatactt 
agatttagga 
ctctcaaagg 
ctgatacccg 
gaatatttta 
ggtatgactt 
ctctctgctc 
ttatcaatgg 
agatctggtg 
aacaacttgg 

agggttgata 

ccactctgta 
gtgcctgaca 
ctactcagta 
agggctataa 
gctctggaag 
ggttttaagt 
cgttttcctg 
gctcgaaaga 
cttgctgaca 
agaattggct 
attcctgaga 
aagggctgta 
gctgaagaac 
gataaaaatt 



cttttaggag 
agagcaccct 
cggccagggc 
caggcgggga 
cgtcaccgcc 
cccccaccag 
cctggtggcc 
ggaaatcagt 
gcaaaaggct 
atttttacag 
ataaagacaa 
aggaagaaga 
aaattgagag 
aaataaaaaa 
aatttaagcc 
atagtgagag 
tgactggaaa 
aacaagcaac 
aaaattctga 
tttggtatca 
ggaggcctga 
attcttgtac 
tcatctgtta 
tcagtgaact 
ttgcatttgg 
aacagactga 
agtatgatag 
acagtgtgct 
aaaaagagga 
cactgggaaa 
ctctagtggc 
aaactaaaac 
gctcccagtt 
gggaaaagct 
cagagtctga 
taggtggttg 
ctaattttga 
ctatcttcac 
agatttttct 
cttgccatac 
accattatgc 
aaatctccga 
aaattcataa 
tgtatgaaga 
gattcaaagt 
ctaaaatcct 
atttgactgt 
ctggacttat 
taagagaaaa 
gtaggaccat 
atttcaccac 
aacgatactg 
ggagggatgt 
acaaggactg 



ctccgtccga 
gtacagcttc 
ctcacgcgaa 
tgcggcctgg 
caaggcgaag 
ttgtgacttc 
ttgtctggtt 
ccgtgttcat 
tttaaagcca 
tgcaaagcct 
gattaagagg 
gatggaggta 
tgaagaggaa 
acgaagggtc 
agacactaag 
tgaaggcctg 
tggctctctt 
tagcatttca 
atcccaagcc 
tgaaacttta 
tcaccccgat 
tcctgggatg 
caaggtgggg 
ggggctggta 
ccgttattca 
gactccagaa 
agtggtgagg 
ggaaggtgat 
agattcttct 
gtttttcata 
acactatccc 
aattctaaag 
ttgggatgca 
aagtgatggc 
ttccattggg 
tgtcttctac 
agaatatatt 
caaagcctat 
gaatggaaca 
tccttttggt 
tattaatgat 
agttgtagag 
tgttgggtct 
aactacatac 
aatgtgtaaa 
taagcaggtc 
agaattgaac 
tactcccaaa 
tgaacagagc 
agtctattgg 
tcgcaatttg 
gaccaaaact 
atcattgaag 
gcagtctgct 



cagaacggtt 
ttccccaagt 
ggcggccgtg 
agcgaggctg 
aacctcaacg 
tcaccaggag 
tacaaccacc 
gtacagtttt 
tatacaggtt 
gaaatactga 
cttgaattgg 
ggcacaactt 
gtacagccta 
atatcagatt 
gaggaaggaa 
aacagccctg 
aaaaggaaaa 
tcagaaacca 
cacgttagtg 
gaatggctta 
tttgatgcat 
aggaagtggt 
aaattttatg 
ttcatgaaag 
gattccctgg 
atgatggagg 
agggagatct 
ccctctgaga 
ggccatactc 
ggtcagtttt 
ccagtacaag 
agttcattgt 
tccaaaactt 
attggggtga 
ttgacaccag 
ctcaaaaaat 
cccttggatt 
caacgaatgg 
aatggttcta 
aagcggctcc 
cgtctagatg 
cttctaaaga 
cccctgaaga 
agcaagaaga 
attataggga 
atctctctgc 
cgatgggata 
gcaggctttg 
ctcctggaat 
gggattggta 
ccagaagaat 
attgaaaaga 
gactgcatgc 
gtagagtgta 



gggccttgcc 
ctccggcgct 
ccgccgctgc 
ggcctgggcc 
gagggctgcg 
atttggtttg 
cctttgatgg 
ttgatgacag 
caaaatcaaa 
gagcaatgca 
cagtttgtga 
acgtaacaga 
agacacaagg 
ctgagagtga 
gcagtgatga 
tcaaagttgc 
gctctaggaa 
agaatacttt 
gaggtggtga 
aggaggaaaa 
ctacactcta 
ggcagattaa 
agctgtacca 
gcaactgggc 
tgcagaaggg 
cacgatgtag 
gtaggatcat 
actacagtaa 
gtgcatatgg 
cagatgatcg 
ttttatttga 
cctgttctct 
tgagaactct 
tgttacccca 
gagagaaaag 
gccttattga 
ctgacacagt 
tgctagatgc 
ctgaaggaac 
taaagcaatg 
ccatagaaga 
agcttccaga 
gtcagaacca 
agattattga 
tcatggaaga 
agacaaaaaa 
cagcctttga 
actctgatta 
acctagagaa 
ggaaccgtta 
acgagttgaa 
agttggctaa 
ggcgactgtt 
tcgcagtgtt 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
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ggatgtttta ctgtgcctgg ctaactatag 
agtaattctg ttgccggaag ataccccccc 
ttgcattacg aagacttttt ttggagatga 
tgaggaagag gagcaggaaa atggcaaagc 
5 ggggggcaag tctacgctta tgagacaggc 
ttgttacgtc cctgctgaag tgtgcaggct 
tggtgcctca gacagaataa tgtcaggtga 
tgccagcata ctcatgcatg caacagcaca 
aggtactgca acatttgatg ggacggcaat 

10 gactataaaa tgtcgtacat tattttcaac 
tcaaaatgtt gctgtgcgcc taggacatat 
ccccagccag gagactatta cgttcctcta 
ctatggcttt aatgcagcaa ggcttgctaa 
tagaaaagca agagaatttg agaagatgaa 

15 cctggctagt gaaaggtcaa ctgtagatgc 
taaggaatta tagactgact acattggaag 
attcagacaa cattatgatc taataaactt 



tcgagggggt gatggtccta tgtgtcgccc 3300 
cttcttagag cttaaaggat cacgccatcc 3360 
ttttattcct aatgacattc taataggctg 3420 
ctattgtgtg cttgttactg gaccaaatat 34 80 
tggcttatta gctgtaatgg cccagatggg 3540 
cacaccaatt gatagagtgt ttactagact 3600 
aagtacattt tttgttgaat taagtgaaac 3660 
ttctctggtg cttgtggatg aattaggaag 3720 
agcaaatgca gttgttaaag aacttgctga 3780 
tcactaccat tcattagtag aagattattc 3840 
ggcatgcatg gtagaaaatg aatgtgaaga 3900 
taaattcatt aagggagctt gtcctaaaag 3960 
tctcccagag gaagt-tattc aaaagggaca 4020 
tcagtcacta cgattatttc gggaagtttg 4080 
tgaagctgtc cataaattgc tgactttgat 4140 
ctttgagttg acttctgaca aaggtggtaa 4200 
tattttttaa aaat 4244 



MSH3 (human) (SEQ ID NO:28) 

20 MSRRKPASGG LAASSSAPAR QAVLSRFFQS 

AFPPQLPPHV ATEIDRRKKR PLENDGPVKK 

VSKSLEKLKE FCCDSALPQS RVQTESLQER 

INQKDTTLFD LSQFGSSNTS HENLQKTASK 

VECGYKYRFF GEDAEIAARE LNIYCHLDHN 

25 TETAALKAIG DNRSSLFSRK LTALYTKSTL 

CISENKENVR DKKKGNIFIG IVGVQPATGE 

SALSEQTEAL IHRATSVSVQ DDRIRVERMD 

SGIVNLEKPV ICSLAAIIKY LKEFNLEKML 

NQTDMKTKGS LLWVLDHTKT SFGRRKLKKW 

30 QIENHLRKLP DIERGLCSIY HKKCSTQEFF 

TVILEIPELL SPVEHYLKIL NEQAAKVGDK 

LQEIRKILKN PSAQYVTVSG QEFMIEIKNS 

RHLNQLREQL VLDCSAEWLD FLEKFSEHYH 

TVQEERKIVI KNGRHPVIDV LLGEQDQYVP 

35 ALITIMAQIG SYVPAEEATI GIVDGIFTRM 
SLVILDELGR GTSTHDGIAI AYATLEYFIR 

MGFLVSEDES KLDPGAAEQV PDFVTFLYQI 

SKELEGLINT KRKRLKYFAK LWTMHNAQDL 



TGSLKSTSSS TGAADQVDPG AAAAAAPPAP 60 
KVKKVQQKEG GSDLGMSGNS EPKKCLRTRN 120 
FAVLPKCTDF DDISLLHAKN AVSSEDSKRQ 180 
SANKRSKSIY TPLELQYIEM KQQHKDAVLC 2 40 
FMTASIPTHR LFVHVRRLVA KGYKVGWKQ 300 
IGEDVNPLIK LDDAVNVDEI MTDTSTSYLL 3 60 
WFDSFQDSA SRSELETRMS SLQPVELLLP 4 20 
NIYFEYSHAF QAVTEFYAKD TVDIKGSQII 4 80 
SKPENFKQLS SKMEFMTING TTLRNLEILQ 54 0 
VTQPLLKLRE INARLDAVSE VLHSESSVFG 600 
LIVKTLYHLK SEFQAIIPAV NSHIQSDIiLR 660 
TELFKDLSDF PLIKKRKDEI QGVIDEIRMH 720 
AVSCIPTDWV KVGSTKAVSR FHSPFIVENY 7 80 
SLCKAVHHLA TVDCIFSLAK VAKQGDYCRP 8 40 
NNTDLSEDSE RVMIITGPNM GGKSSYIKQV 900 
GAADNIYKGR STFMEELTDT AEIIRKATSQ 960 
DVKSLTLFVT HYPPVCELEK NYSHQVGNYH 1020 
TRGIAARSYG LNVAKLADVP GEILKKAAHK 1080 
QKWTEEFNME ETQTSLLH 1128 



40 MSH3 (human DNA) (SEQ ID NO:29) 

gggcacgagc cctgccatgt ctcgccggaa gcctgcgtcg ggcggcctcg ctgcctccag 60 

ctcagcccct gcgaggcaag cggttttgag ccgattcttc cagtctacgg gaagcctgaa 120 

atccacctcc tcctccacag gtgcagccga ccaggtggac cctggcgctg cagcggccgc 180 

agcgccccca gcgcccgcct tcccgcccca gctgccgccg cacgtagcta cagaaattga 240 

45 cagaagaaag aagagaccat tggaaaatga tgggcctgtt aaaaagaaag taaagaaagt 300 

ccaacaaaag gaaggaggaa gtgatctggg aatgtctggc aactctgagc caaagaaatg 360 

tctgaggacc aggaatgttt caaagtctct ggaaaaattg aaagaattct gctgcgattc 4 20 

tgcccttcct caaagtagag tccagacaga atctctgcag gagagatttg cagttctgcc 4 80 

aaaatgtact gattttgatg atatcagtct tctacacgca aagaatgcag tttcttctga 540 

50 agattcgaaa cgtcaaatta atcaaaagga cacaacactt tttgatctca gtcagtttgg 600 

atcatcaaat acaagtcatg aaaatttaca gaaaactgct tccaaatcag ctaacaaacg 660 

gtccaaaagc atctatacgc cgctagaatt acaatacata gaaatgaagc agcagcacaa 7 20 

agatgcagtt ttgtgtgtgg aatgtggata taagtataga ttctttgggg aagatgcaga 780 

gattgcagcc cgagagctca atatttattg ccatttagat cacaacttta tgacagcaag 840 

55 tatacctact cacagactgt ttgttcatgt acgccgcctg gtggcaaaag gatataaggt 900 

gggagttgtg aagcaaactg aaactgcagc attaaaggcc attggagaca acagaagttc 960 

actcttttcc cggaaattga ctgcccttta tacaaaatct acacttattg gagaagatgt 1020 

gaatccccta atcaagctgg atgatgctgt aaatgttgat gagataatga ctgatacttc 1080 

taccagctat cttctgtgca tctctgaaaa taaggaaaat gttagggaca aaaaaaaggg 1140 

60 caacattttt attggcattg tgggagtgca gcctgccaca ggcgaggttg tgtttgatag 1200 

tttccaggac tctgcttctc gttcagagct agaaacccgg atgtcaagcc tgcagccagt 12 60 

agagctgctg cttccttcgg ccttgtccga gcaaacagag gcgctcatcc acagagccac 1320 
atctgttagt gtgcaggatg acagaattcg agtcgaaagg atggataaca tttattttga 1380 
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atacagccat gctttccagg cagttacaga gttttatgca aaagatacag ttgacatcaa 1440 
aggttctcaa attatttctg gcattgttaa cttagagaag cctgtgat?? gc?c??tqqS JsoS 

T^ 0 ^ aa 9aattcaa cttggaLag atgctctcca Jaccrgagaa 
ttttaaacag ctatcaagta aaatggaatt tatgacaatt aatggaacaa cattaaaaaa 16-0 

agacca^ct aaJac?tf t * gaCtgatat ^aaaccaaa ggaJgtttgc tgtgg^t 16sS 
agaccacact aaaacttcat ttgggagacg gaagttaaag aagtgggtga cccaaccact 1740 

TcllTtltt "SSS™ f gcccq t ct tgat ^tgtS tc g gg ll g g tL tccStcS Uti 

atctagtgtg tttggtcaga tagaaaatca tctacgtaaa ttgcccgaca taqaoaaaao 18 60 
tSaSSl C a " tat " ca aaaaatgttc tacccaagag ttrttcttga ttg?£22 
tttatatcac ctaaagtcag aatttcaagc aataatacct gctgttaatt cccacattca 1980 
?^ a ^f tg ctccggaccg ttattttag. aattcctgaa Itcctcag" cag^ggagca 20 4 S 
"^" aaag ata ^caatg aacaagctgc caaagttggg gataaaactg aattatt?aa ?100 
agacctttct gacttccctt taataaaaaa gaggaaggat gaaattcaag gtgttattqa 2160 
t 9 JT tCC f a atgCattt ^ C aagaaatacg aaaaatacta aaaaatcct? ctgcacaata 222Q 
tgtgacagta tcaggacagg agtttatgat agaaataaag aactctgctg tatcttgta? 2280 
a ^ a ^ gat tgg f aaa ^ "ggaagcac aaaagctgtg agccgctttc actctcct?t 234^ 
tattgtagaa aattacagac atctgaatca gctccgggag cagctagtcc ttgactgcaq 2400 
tgctgaatgg cttgattttc tagagaaatt cagtgaacat tatcac?cct tgtgSHgc 24 60 
agtgcatcac ctagcaactg ttgactgcat tttctccctg gccaaggtcg ctaagcaagg All 
agattactgc agaccaactg tacaagaaga aagaaaaatt gtaataaaaa atgqlaqqcl 2580 
gatgtgttgC ^ggagaaca ggatcaatat gtcccaaata atfcaga?tt 26,0 
atcagaggac tcagagagag taatgataat taccggacca aacatgggtg gaaagagctc 2700 
ctacataaaa caagttgcat tgattaccat catggctcag attggllcc? Itgt?cc?qc 2760 
? gaagaagCg aca ^ttggga ttgtggatgg cattttcaca aggatgggtg ctgcagacaa 2B20 
tatatataaa ggacggagta catttatgga agaactgact gacacagcag aaitaltcaq 2880 
aaaagcaaca tcacagtcct tggttatctt ggatgaacta ggaagagggf cgagSctS 2SA0 
tgatggaatt gccattgcct atgctacact tgagtatttc atcagaf^g tjaatcctt 3000 
aaccctgttt gtcacccatt atccgccagt ttgtgaacta gaaalaaatt aSt^acacca 3060 
ggtggggaat tacoacatgg gattcttggt cagtgaggat gaaagcaaac Jggatccag* 3?20 
cgcagcagaa caagtccctg attttgtcac cttcctttac Laataacta glggwSg? llll 
agcaaggagt tatggattaa atgtggctaa actagcagat gttcctggag LSttttaS 3240 
caaata 9 ?^ CaCaa ^ aa -Wctgga aggattaata ZatacgSaZ gaaagagfc* Hoi 
caagtatttt gcaaagttat ggacgatgca taatgcacaa gacctgcaga Igtggacaqa 3360 
35 lilt ° a ^9^gaaa cacagacttc tcttcttcat taaaalgalg actacaSg" llto 
Strtt?^ a ^ ggagaatt aaaaatacca actgtacaaa ataactctcc agtJacagcl 3480^ 
tatctttgtg tgacatgtga gcataaaatt atgaccatgg tatattccta ttaoa™ 

ss: nsss?2 tct s^ aa tcctaS* tiziiTttii Ho? 

aa "^f" g aata g a <=ttc cactttgtaa ttagaaaatt ttatggacag taagtccagt 3660 
40 iStE"?^ a ^ aattc = ca agcttttgga gggtgltatJ aaaat?S£ lllo 

t 9 " S^ a ?^ aa ttg 9 caac tg ggtgaatctg gcaggaatct 3780 

tttttatlaa Sa*™ t "tattatgc aaccagttta tccaccaaga acataagaat 3840 
tttttataag tagaaagaat tggccaggca tggtggctca tgcctgtaat cccaqcactt 3900 
tgggaggcca aggtaggcag atcacctgag gtcaggagtt caagaccagc rtggccaaca" 39^0 
45 Sqctaaac tZTr,^* 0 taaaaatata aagtacatct ctaStaaaL tacgaaaaaa 4ol2 
" ag ^ gggC ^g^cgc acacctgtag tcccagctac tccggaggct gagfcaggaq 4080 
aatctcttga acctgggagg cggaggttgc aatgagccga gatcacgtca ctgcactcca 414 0 

^ ga l" aga f CCatCtCa ««-g««« aagaaalgaa aS Sta 20^ 
? aa ?^ ta aaaacta gag cacagaagga ataaggtcat gaaatttaaa aggttaaata 4260 

50 tlltltltll a SS2«S " aaaga " 9 "^atgaaa ttatttgtca ttSttSjg JUS 
taataaatat ttaatgaata cttgctataa aaaaaaaaaa aaaaaaaaaa aaaa 4374 

Each reference cited herein is hereby incorporated by reference in its entirety. 
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We claim: 

1 . A method for making a hypennutable cell comprising exposing a cell to an inhibitor 
of mismatch repair, wherein said inhibitor is an anthracene, an ATPase inhibitor, a nuclease 
inhibitor, a polymerase inhibitor, or an antisense oligonucleotide that specifically hybridizes 
to a nucleotide encoding a mismatch repair protein. 

2. The method of claim 1 wherein said inhibitor is an anthracene. 

3. The method of claim 2 wherein said anthracene has the formula: 




wherein R^Rio ^ independently hydrogen, hydroxyl, amino, alkyl, substituted alkyl, alkenyl, 
substituted alkenyl, alkynyl, substituted alkynyl, O-alkyl, S-alkyl, N-alkyl, O-alkenyl, S- 
alkenyl, N-alkenyl,0-alkynyL, S-alkynyl, N-alkynyL, aryl, substituted aryl, aryloxy, substituted 
aryloxy, heteroaryl, substituted heteroaryl, aralkyloxy, arylalkyl, alkylaryl, alkylaryloxy, 
arylsulfonyl, alkylsulfonyl, alkoxycarbonyl, aryloxycarbonyl, guanidino, carboxy, an alcohol, 
an amino acid, sulfonate, alkyl sulfonate, CN, N0 2? an aldehyde group, an ester, an ether, a 
crown ether, a ketone, an organosulfur compound, an organometallic group, a carboxylic acid, 
an organosilicon or a carbohydrate that optionally contains one or more alkylated hydroxyl 
groups; 

wherein said heteroalkyl, heteroaryl, and substituted heteroaryl contain at least one 
heteroatom that is oxygen, sulfur, a metal atom, phosphorus, silicon or nitrogen; and 

wherein said substituents of said substituted alkyl, substituted alkenyl, substituted 
alkynyl, 

substituted aryl, and substituted heteroaryl are halogen, CN, N0 2 , lower alkyl, aryl, heteroaryl, 
aralkyl, aralkyloxy, guanidino, alkoxycarbonyl, alkoxy, hydroxy, carboxy and amino; 

and wherein said amino groups optionally substituted with an acyl group, or 1 to 3 aryl 
or lower alkyl groups. 
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4. The method of claim 3 wherein R 5 and P^ are hydr* 



ogen. 



5. The method of claim 3 wherein Rl -R I0 are independently hydrogen, hydroxyl, alkyl, 
aryl, arylaklyl, or hydroxyalkyl. 

6. The method of claim 3 wherein R r R 10 are independently hydrogen, hydroxyl, methyl, 
ethyl, propyl, isopropyl, butyl, isobutyl, phenyl, tolyl, hydroxymethyl, hydroxypropyl, or 
hydroxybutyl. 

7. The method of claim 3 wherein said anthracene is selected from the group consisting 
of 1,2-dimethylanthracene, 9,10-dimethyl anthracene, 7,8-dimethylanthracene, 9,10- 
diphenylanthracene, 9,10-dihydroxymethylanthracene, 9-hydroxymethyl-lO- 
methylanthracene, dimethylanthracene-l,2-diol, 9.hyd r o X ymeth y l-10-methylanthracen e -l,2- 
diol, P-hydroxymemyl-lO-memylanthracene-S^diol, and 9, 10-di-m-tolyanthracene. 

8. The method of claim 3 wherein R 3 , R,, R,, R 6 , R? , ^ ^ ^ m hydrogen 

9. The method of claim 3 wherein R„ R 2 , R 3 , R 4 , R 5 , r 6> R?and Rg m hydrogen 

10. The method of claim 3 wherein R l5 R 2 , R 35 R 4 , R 5 , r 6j r? md Rg m hydrogen 

11. The method of claim 3 wherein R] , R 2 , R 3 , R,, r,, r 6> r, ^ ^ m hydrogen 

12. The method of claim 3 wherein R„ R 2 , R 3 , R 4 , R 6> r ? ^ Rg m hydrogen 

13. The method of claim 3 wherein R„ R 2 , R 3 , R,, R,, r ? Rg ^ R]o3re hydrogen . 

14. The method of claim 1 wherein said ATPase inhibitor is nonhydrolyzable forms of 
ATP such as AMP-PNP. 

15. The method of claim 1 wherein said a nuclease inhibitor is an analog of N- 
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Ethylmaleimide, a heterodimeric adenine-chain-acridine compounds, or a quinilone such 
as Heliquinomycin. 

16. The method of claim 1 wherein said polymerase inhibitor is an analog of 
aphidicolin, 1 -(2'-Deoxy-2 f -fluoro-beta-L-arabinofuranos3d)-5-methyluracil (L-FM AU) or 
2 t ,3 , -dideoxyribonucleoside S'-triphosphates. 

17. The method of claim 1 wherein said antisense ohgonucleotide comprises about 15 
consecutive nucleotides that are complementary to the coding strand of a mismatch repair 
protein, wherein said antisense oligonucleotide specifically binds to said coding strand of 
said mismatch repair protein under physiological conditions and inhibits mismatch repair 
activity of said mismatch repair protein. 

18. The method of claim 17 wherein said antisense oligonucleotide specifically binds 
to a regulatory portion on said coding strand of said mismatch repair protein. 

19. The method of claim 17 wherein said antisense oligonucleotide is directed against 
the first six codons of a MMR gene message. 

20. The method of claim 1 wherein said inhibitor of mismatch repair is introduced into 
a growth medium of a eukaryotic cell in vitro. 

21 . The method of claim 1 wherein said inhibitor of mismatch repair is introduced into 
a growth medium of a prokaryotic cell in vitro. 

22. The method of claim 1 wherein said inhibitor of mismatch repair is introduced into 
a growth medium of a plant. 

23 . A method for generating a mutation in a gene of interest comprising exposing a cell 
comprising said gene of interest to a chemical mismatch repair inhibitor and testing said 
cell to determine whether said gene of interest comprises a mutation. 
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24. The method of claim 23 wherein said testing comprises analyzing a polynucleotide 
sequence of said gene of interest. 

25. The method of claim 23 wherein said testing comprises analyzing a protein 
encoded by said gene of interest. 



26. The method of claim 23 wherein said testing comprises analyzing the phenotype of 
said cell. 



27. The method of claim 23 wherein said cell is a mammalian cell and wherein said 
mammalian cell is made mismatch repair defective by exposing said mammalian cell to an 
inhibitor of mismatch repair. 

28. The method of claim 27 further comprising removing the chemical inhibitor of 
mismatch repair after detennining that said gene of interest comprises a mutation. 

29. The method of claim 27 wherein said testing comprises analyzing a polynucleotide 
sequence of said gene of interest. 

30. The method of claim 27 wherein said testing comprises analyzing a protein 
encoded by said gene of interest. 



31. The method of claim 27 wherein said testing comprises analyzing the phenotype of 
said cell. 



32. A method for generating a mutation in a gene of interest comprising exposing an 
animal to a chemical inhibitor of mismatch repair and testing said animal to determine 
whether the gene of interest comprises a mutation. 

3 3 . The method of claim 32 wherein said animal is a mammal. 
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34. The method of claim 32 wherein said testing comprises analyzing a polynucleotide 
sequence of said gene of interest. 

35. The method of claim 32 wherein said testing comprises analyzing a protein 
encoded by said gene of interest. 

36. The method of claim 32 wherein said testing comprises analyzing the phenotype of 
said cell. 

37. The method of claim 33 wherein said mammal is made mismatch repair defective 
by exposing said mammal to an inhibitor of mismatch repair. 

38. The method of claim 37 further comprising removing said inhibitor of mismatch 
repair after determining that said gene of interest comprises a mutation. 

39. A hypennutable transgenic mammal made by the method of claim 33. 

40. A method for generating a mismatch repair defective plant comprising exposing 
said plant to an inhibitor of mismatch repair. 

41 . A method for generating a mutation in a gene of interest comprising growing a 
plant comprising said gene of interest, exposing said plant to an inhibitor of mismatch 
repair, and testing said plant to determine whether said gene of interest comprises a 
mutation. 

42. The method of claim 41 wherein said testing comprises analyzing a polynucleotide 
sequence of said gene of interest. 

43. The method of claim 41 wherein said testing comprises analyzing a protein 
encoded by said gene of interest. 

44. The method of claim 41 wherein said testing comprises analyzing the phenotype of 
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said plant. 

45. The method of claim 41 wherein said plant is made mismatch repair defective by 
exposing said plant to an inhibitor of mismatch repair. 

46. A hypermutable plant made by the method of claim 40. 

47. The plant of claim 46 wherein said plant is monocot. 

48. The plant of claim 46 wherein said plant is dicot. 

49. A method for screening for chemical inhibitors of mismatch repair comprising 
exposing an organism to a candidate compound and screening the DNA of said organism 
for microsatellite instability, 

50. The method of claim 49 wherein said organism is a mammal. 

5 1 . The method of claim 49 wherein said organism is a microbe. 

52. The method of claim 49 wherein said organism is a plant. 

53. The method of claim 49 wherein said screening comprises monitoring endogenous 
microsatellites. 

54. The method of claim 49 wherein said screening comprises the use of reporter 
expression genes, wherein said reporter expression genes comprise polynucleotide repeats 
within a coding region of said reporter gene. 

55. The method of claim 54 wherein said reporter gene is p-glucuronidase. 

56. A method for blocking mismatch repair activity in vivo comprising exposing a cell 
to an anthracene compound. 
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57. 



The method of claim 56 wherein said anthracene comprises the formula: 




wherein R]-Rio are independently hydrogen, hydroxyl, amino, alkyl, substituted alkyl, alkenyl, 
substituted alkenyl, alkynyl, substituted alkynyl, O-alkyl, S-alkyl, N-alkyl, O-alkenyl, S- 
alkenyl, N-alkenyl,0-alkynyl, S-alkynyl, N-alkynyl, aryl, substituted aryl, aryloxy, substituted 
aryloxy, heteroaryl, substituted heteroaryl, aralkyloxy, arylalkyl, alkylaryl, alkylaryloxy, 
arylsulfonyl, alkylsulfonyl, alkoxycarbonyl, aryloxycarbonyl, guanidino, carboxy, an alcohol, 
an amino acid, sulfonate, alkyl sulfonate, CN, N0 2 , an aldehyde group, an ester, an ether, a 
crown ether, a ketone, an organosulfiir compound, an organometallic group, a carboxylic acid, 
an organosilicon or a carbohydrate that optionally contains one or more alkylated hydroxyl 
groups; 

wherein said heteroalkyl, heteroaryl, and substituted heteroaryl contain at least one 
heteroatom that is oxygen, sulfur, a metal atom, phosphorus, silicon or nitrogen; and 

wherein said substituents of said substituted alkyl, substituted alkenyl, substituted 
alkynyl, 

substituted aryl, and substituted heteroaryl are halogen, CN, N0 2 , lower alkyl, aryl, heteroaryl, 
aralkyl, aralkyloxy, guanidino, alkoxycarbonyl, alkoxy, hydroxy, carboxy and amino; 

and wherein said amino groups optionally substituted with an acyl group, or 1 to 3 aryl 
or lower alkyl groups. 

58. The method of claim 57 wherein and are hydrogen. 

59. The method of claim 57 wherein R^Rjq are independently hydrogen, hydroxyl, alkyl, 
aryl, arylaklyl, or hydroxyalkyl. 
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60. The method of claim 57 wherein R,-R 10 are independently hydrogen, hydroxy], methyl, 
ethyl, propyl, isopropyl, butyl, isobutyl, phenyl, tolyl, hydroxymethyl, hydroxypropyl, or 
hydroxybutyL 

61. The method of claim 57 wherein said anthracene is selected from the group consisting 
of 1,2-dimethylanthracene, 9,10-dimethyl anthracene, 7,8-dimethylanthracene, 9,10- 
diphenylanthracene, 9,10-dihydroxymethylanthracene, 9-hydroxymethyl-10- 
methylantbracene,dimemylanthracene-l,2-diol,9-hydroxymemyl-10-memyla^ 

diol, 9-hydroxymethyl-10-methylanthracene-3,4-diol, and 9, 10-di-m-toly anthracene. 
R 3 , R 4 , 

62. The method of claim 57 wherein R 3 , R 4 , R,, R,, R 7 , R 8 , R, an d R 10 are hydrogen. 

63. The method of claim 57 wherein R„ R,, R 3 , R 4 , R,, R,, R? and R* are hydrogen. 

64. The method of claim 57 wherein R l5 R,, R 3 , R,, R,, ^ R? and R« are hydrogen. 

65. The method of claim 57 wherein R„ R,, R 3 , R 4 , R,, R,, R, and R 10 are hydrogen. 

66. The method of claim 57 wherein R„ R 2 , R 3 , R 4 , R 5 , R 6 , R? and R 8 are hydrogen. 

67. The method of claim 57 wherein R„ R 2 , R 3 , R 4 , R,, R 6 , R 7 Rg R I0 are hydrogen. 

68. The method of claim 23 further comprising exposing said cell to a mutagen. 

69. The method of claim 32 further comprising exposing said animal to a mutagen. 

70. The method of claim 68 or 69 wherein said mutagen is selected from the group 
consisting of N-memyl-N'-mtro-N-nitrosoguanidine, methane sulfonate, dimethyl 
sulfonate, O-6-methyl benzadine, ethyl methanesulfonate, methylnitrosourea, and 
ethylnitrosourea. 
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71 . The method of claim 49 wherein the chemcial is a MMR inhibitor wherein it 
induces microsatellite instability in MMR proficient cells but does not induce enhanced 
microsattelite instability in MMR deficient cells. 
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SEQUENCE LISTING 

<110> Nicolaides, Nicholas C 
Grasso, Luigi 
Sass, Philip M 

<120> CHEMICAL INHIBITORS OF MISMATCH REPAIR 
<130> MOR-0018 

<140> 00/000,000 
<141> 2001-01-15 

<160> 44 

<170> Patentln Ver. 2.1 

<210> 1 
<211> 52 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : oligonucleotide 
primer 

<400> 1 

ggatcctaat acgactcact atagggagac caccatgtcg ttcgtggcag gg 52 

<210> 2 
<211> 21 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : oligonucleotide 
primer 

<400> 2 

taagtcttaa gtgctaccaa c 



<210> 3 
<211> 53 
<212> DNA 

<213> Artificial Sequence 



1 
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<220> 

<223> Description of Artificial Sequence : oligonucleotide 
primer 

<400> 3 

ggatcctaat acgactcact atagggagac caccatggaa caattgcctg egg 53 



<210> 4 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : oligonucleotide 
primer 

<400> 4 

aggttagtga agactctgtc 20 



<210> 5 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : oligonucleotide 
primer 

<400> 5 

ctgatctcac ggacaatagt gc 22 



<210> 6 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : oligonucleotide 
primer 

<4 00> 6 

ggctccataa aaagtgcacc 20 



<210> 7 
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<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : oligonucleotide 
primer 

<400> 7 

ggtctgttga tgtcgtaagt eg 



<210> 8 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : oligonucleotide 
primer 

<400> 8 

atcttgaaac ctttagggag gg 

22 



<210> 9 
<211> 18 
<212> DNA 

<213> Artificial Sequence 



<220> 

<223> Description of Artificial Sequence : oligonucleotide 

primer 

<400> 9 

agaagtttag acaggtac 



<210> 10 
<211> 18 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : oligonucleotide 
primer 

<400> 10 



18 



3 
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13 



<210> 11 
<211> 51 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : oligonucleotide 
primer 

<400> 11 

cccggatcca tgttaaaaaa aaaaaaaaaa aaaaaacgtc ctgtagaaac c 51 

<210> 12 
<211> 48 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : oligonucleotide 
primer 

<400> 12 

cccggatcca tgttaaaaaa aaaaaaaaaa aaaacgtcct gtagaaac 4 8 

<210> 13 
<211> 30 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : oligonucleotide 
primer 

<400> 13 

cccgaattcc ccgatctagt aacatagatg 30 

<210> 14 

<211> 859 

<212> PRT 

<213> Mus musculus 

<400> 14 

4 
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5 10 15 

Pro lie Asp Gly Lys Ser Val His Gin lie Cys Ser Gly Gin Val He 

25 30 

Leu Ser Leu Ser Thr Ala Val Lys Glu Leu He Glu Asn Ser Val Asp 
35 45 

Ala Gly Ala Thr Thr He Asp Leu Arg Leu Lys Asp Tyr Gly Val Asp 



55 60 



Leu lie Glu Val Ser Asp Asn Gly Cys Gly Val Glu Glu Glu Asn Phe 
DO 70 



75 



80 



Glu Gly Leu Ala Leu Lys His His Thr Ser Lys He Gin Glu Phe Ala 

85 90 95 

Asp Leu Thr Gin Val Glu Thr Phe Gly Phe Arg Gly Glu Ala Leu Ser 

Ser Leu Cys Ala Leu Ser Asp Val Thr He Ser Thr Cys His Gly Ser 
115 120 125 

Ala Ser Val Gly Thr Arg Leu Val Phe Asp His Asn Gly Lys He Thr 

135 140 

Gin Lys Thr Pro Tyr Pro Arg Pro Lys Gly Thr Thr Val Ser Val Gin 

150 160 
His Leu Phe Tyr Thr Leu Pro Val Arg Tyr Lys Glu Phe Gin Arg Asn 
165 "0 175 

He Lys Lys Glu Tyr Ser Lys Met Val Gin Val Leu Gin Ala Tyr Cys 
180 "5 190 

He lie Ser Ala Gly Val Arg Val Ser Cys Thr Asn Gin Leu Gly Gin 
195 200 205 

Gly Lys Arg His Ala Val Val Cys Thr Ser Gly Thr Ser Gly Met Lys 

215 220 

Glu Asn lie Gly Ser Val Phe Gly Gin Lys Gin Leu Gin Ser Leu Il e 

230 235 240 

Pro Phe Val Gin Leu Pro Pro Ser Asp Ala Val Cys Glu Glu Tyr Gly 



245 250 255 
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Leu Ser Thr Ser Gly Arg His Lys Thr Phe Ser Thr Phe Arg Ala Ser 
260 265 270 

Phe His Ser Ala Arg Thr Ala Pro Gly Gly Val Gin Gin Thr Gly Ser 
275 280 285 

Phe Ser Ser Ser lie Arg Gly Pro Val Thr Gin Gin Arg Ser Leu Ser 
290 295 300 

Leu Ser Met Arg Phe Tyr His Met Tyr Asn Arg His Gin Tyr Pro Phe 
305 310 315 320 

Val Val Leu Asn Val Ser Val Asp Ser Glu Cys Val Asp lie Asn Val 
325 330 335 

Thr Pro Asp Lys Arg Gin lie Leu Leu Gin Glu Glu Lys Leu Leu Leu 
340 345 350 

Ala Val Leu Lys Thr Ser Leu lie Gly Met Phe Asp Ser Asp Ala Asn 
355 360 365 

Lys Leu Asn Val Asn Gin Gin Pro Leu Leu Asp Val Glu Gly Asn Leu 
370 375 380 

Val Lys Leu His Thr Ala Glu Leu Glu Lys Pro Val Pro Gly Lys Gin 
385 390 395 400 

Asp Asn Ser Pro Ser Leu Lys Ser Thr Ala Asp Glu Lys Arg Val Ala 
405 410 415 

Ser lie Ser Arg Leu Arg Glu Ala Phe Ser Leu His Pro Thr Lys Glu 
420 425 430 

lie Lys Ser Arg Gly Pro Glu Thr Ala Glu Leu Thr Arg Ser Phe Pro 
435 440 445 

Ser Glu Lys Arg Gly Val Leu Ser Ser Tyr Pro Ser Asp Val lie Ser 
450 455 460 

Tyr Arg Gly Leu Arg Gly Ser Gin Asp Lys Leu Val Ser Pro Thr Asp 
465 470 475 480 

Ser Pro Gly Asp Cys Met Asp Arg Glu Lys lie Glu Lys Asp Ser Gly 
485 490 495 

Leu Ser Ser Thr Ser Ala Gly Ser Glu Glu Glu Phe Ser Thr Pro Glu 
500 505 510 



6 
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Val Ala Ser Ser Phe Ser Ser Asp Tyr Asn Val Ser Ser Leu Glu Asp 

DID con 

szo 525 

Arg Pro Ser Gin Glu Thr He Asn Cys Gly Asp Leu Asp Cys Arg PrQ 

535 540 

Pro Gly Thr Gly Gin Ser Leu Lys Pro Glu Asp His Gly Tyr Gin Cys 

550 "5 560 

Lys Ala Leu Pro Leu Ala Arg Leu Ser Pro Thr Asn Ala Lys Arg Phe 



565 



570 



575 



Lys Thr Glu Glu Arg Pro Ser Asn Val Asn He Ser Gin Arg Leu Pro 



580 



585 



590 



Gly Pro Gin Ser Thr Ser Ala Ala Glu Val Asp Val Ala He Lys Met 
595 600 605 

Asn Lys Arg He Val Leu Leu Glu Phe Ser Leu Ser Ser Leu Ala Lys 

615 620 

Arg Met Lys Gin Leu Gin His Leu Lys Ala Gin Asn Lys His Glu Leu 

630 635 640 

Ser Tyr Arg Lys Phe Arg Ala Lys H e C ys Pro Gly Glu Asn Gin Ala 



645 



650 



655 



Ala Glu Asp Glu Leu Arg Lys Glu He Ser Lys Ser Met Phe Ala Glu 



660 



665 



670 



Met Glu lie Leu Gly Gin Phe Asn Leu Gly P he He Val Thr Lys Leu 



680 



685 



Lys Glu Asp Leu Phe Leu Val Asp Gin His Ala Ala Asp Glu Lys Tyr 

695 700 

Asn Phe Glu Met Leu Gin Gin His Thr Val Leu Gin Ala Gin Arg Leu 

10 715 720 

He Thr Pro Gin Thr Leu Asn Leu Thr Ala Val Asn Glu Ala Val Leu 
725 730 735 

He Glu Asn Leu Glu lie Phe Arg Lys Asn Gly Phe Asp Phe Val ll e 
740 745 750 

Asp Glu Asp Ala Pro Val Thr Glu Arg Ala Lys Leu He Ser Leu Pro 
755 ? 60 765 
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Thr Ser Lys Asn 
770 

Phe Met Leu Ser 
785 

Arg Gin Met Phe 



Thr Ala Leu Asn 
820 

Glu Met Asp His 
835 

His Val Ala Asn 
850 



Trp Thr Phe Gly 
775 

Asp Ser Pro Gly 
790 

Ala Ser Arg Ala 
805 

Ala Ser Glu Met 



Pro Trp Asn Cys 
840 

Leu Asp Val lie 
855 



Pro Gin Asp He 
780 

Val Met Cys Arg 
795 

Cys Arg Lys Ser 
810 

Lys Lys Leu He 
825 

Pro His Gly Arg 



Ser Gin Asn 



Asp Glu Leu He 



Pro Ser Arg Val 
800 

Val Met lie Gly 
815 

Thr His Met Gly 
830 

Pro Thr Met Arg 
845 



<210> 15 
<211> 3056 
<212> DNA 

<213> Mus musculus 
<400> 15 

gaattccggt gaaggtcctg aagaatttcc 
taacctgtcg tcaggtaacg atggtgtata 
gtcttttccc gagagcggca ccgcaactct 
catccatgga gcaaaccgaa ggcgtgagta 
atgggaagtc agtccatcaa atttgttctg 
tgaaggagtt gatagaaaat agtgtagatg 
aagactatgg ggtggacctc attgaagttt 
actttgaagg tctagctctg aaacatcaca 
cgcaggttga aactttcggc tttcgggggg 
atgtcactat atctacctgc cacgggtctg 
ataatgggaa aatcacccag aaaactccct 
tgcagcactt attttataca ctacccgtgc 
aggagtattc caaaatggtg caggtcttac 
gtgtaagctg cactaatcag ctcggacagg 
gcacgtctgg catgaaggaa aatatcgggt 
tcattccttt tgttcagctg ccccctagtg 
cttcaggacg ccacaaaacc ttttctacgt 
cgccgggagg agtgcaacag acaggcagtt 
agcaaaggtc tctaagcttg tcaatgaggt 
catttgtcgt ccttaacgtt tccgttgact 
ataaaaggca aattctacta caagaagaga 
tgataggaat gtttgacagt gatgcaaaca 
atgttgaagg taacttagta aagctgcata 



agattcctga gtatcattgg aggagacaga 60 
tgcaacagaa atgggtgttc ctggagacgc 120 
cccgcggtga ctgtgactgg aggagtcctg 180 
cagaatgtgc taaggccatc aagcctattg 24 0 
ggcaggtgat actcagttta agcaccgctg 300 
ctggtgctac tactattgat ctaaggctta 360 
cagacaatgg atgtggggta gaagaagaaa 420 
catctaagat tcaagagttt gccgacctca 480 
aagctctgag ctctctgtgt gcactaagtg 540 
caagcgttgg gactcgactg gtgtttgacc 600 
acccccgacc taaaggaacc acagtcagtg 660 
gttacaaaga gtttcagagg aacattaaaa 720 
aggcgtactg tatcatctca gcaggcgtcc 780 
ggaagcggca cgctgtggtg tgcacaagcg 840 
ctgtgtttgg ccagaagcag ttgcaaagcc 900 
acgctgtgtg tgaagagtac ggcctgagca 960 
ttcgggcttc atttcacagt gcacgcacgg 1020 
tttcttcatc aatcagaggc cctgtgaccc 1080 
tttatcacat gtataaccgg catcagtacc 1140 
cagaatgtgt ggatattaat gtaactccag 1200 
agctattgct ggccgtttta aagacctcct 1260 
agcttaatgt caaccagcag ccactgctag 1320 
ctgcagaact agaaaagcct gtgccaggaa 1380 
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™t:i zzizii: iTcZir T°r cga 9 — 999ta "«° 

a I y a 99 cc1: tt tctcttcatc ctactaaaga gatcaagtct aggggtccaq 1500 

TtcZtii ::tT:: gg agtttt — gggcgtgtta t iT ctta a t i 

cttcagacgt catctcttac agaggcctcc gtggctcgca ggacaaattg gtgagtccca 1620 
cggacagccc tggtgactgt atggacagag agaaaataga aaaagactc ggjctcagca 680 
gcacctcagc tggctctgag gaagagttca gcaccccaga agtggccagt agctttajca 1740 
gtgactataa cgtgagctcc ctagaagaca gaccttctca ggaaaccaJa aLtrtgrtq 1800 
acctggactg ccgt^tcca ggtacaggac agtccttgaa gccagaagac catggltatc I860 
aatgcaaagc tctacctcta gctcgtctgt cacccacaaa tgccaagcgc "c«^«g Ht 0 
aggaaagacc ctcaaatgtc aacatttctc aaagattgcc tggtcctcag a i 980 
cagctgaggt cgatgtagcc ataaaaatga ataagagaat cgtgctcctc ga^ttctctc 2040 

aa^tgaltta ZtTTrl t.«g«cct aaaggcgcag a'acaaacatg 21 0 

aactgagtta cagaaaattt agggccaaga tttgccctgg agaaaaccaa gcagcagaag 2160 
atgaactcag aaaagagatt agtaaatcga tgtttgcaga gatggagatc ttgg^tcagt -20 
ttaacctggg atttatagta accaaactga aagaggacct cttcctggtg gacclocatg T 28 0 
ctgcggatga gaagtacaac tttgagatgc tgcagcagca cacggtgctc caggcgcag! 2340 

atctggaaat attcagaaag aatggctttg actttgtcat tgatgaggat gctccagtca 2460 
ctgaaagggc taaattgatt tccttaccaa ctagtaaaaa ctggaccttt 
atatagatga actgatcttt atgttaagtg acagccctgg ggtcatgtgc cggccctcac 
gagtcagaca gatgtttgct tccagagcct gtcggaagtc agtgatjatt glHcggcgc tell 
IctacccT 9 C9agatgaag -^catca cccacatggg tgagatggac caccc 270 
actgccccca cggcaggcca accatgaggc acgttgccaa tctggatgtc atctctcLa 2760 

tTti:::::: :TTr t tcg g gtt 9 tgc aa :z 9 :z nil 

ttttaagtaa tctgattatc gttgtacaaa aattagcatg ctgctttaat gtactggatc 2880 
tgatccaat. *«W«tga tggagtgttc ctctagctca gctact'tggg 2 0 

ajactcaltt llTa ' «*ttg.g.c cactccgagc cacattcatg 30 0 

agactcaatt caaggacaaa aaaaaaaaga tatttttgaa gccttttaaa aaaaaa 3056 



<210> 16 

<211> 862 

<212> PRT 

<213> Homo sapiens 



<400> 16 

Met Glu Arg Ala Glu Ser Ser Ser Thr Glu Pro Ala Lys Ala He Lys 



10 



15 



Pro lie Asp Arg Lys Ser Val His Gin He Cys Ser Gly Gin Val Val 
20 25 30 

Leu Ser Leu Ser Thr Ala Val Lys Glu Leu Val Glu Asn Ser Leu Asp 
35 «0 45 

Ala Gly Ala Thr Asn He Asp Leu Lys Leu Lys Asp Tyr Gly 



55 



Val Asp 



60 



9 
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Leu lie Glu Val Ser Asp Asn Gly Cys Gly Val Glu Glu Glu Asn Phe 
65 70 75 80 

Glu Gly Leu Thr Leu Lys His His Thr Ser Lys lie Gin Glu Phe Ala 
85 90 95 

Asp Leu Thr Gin Val Glu Thr Phe Gly Phe Arg Gly Glu Ala Leu Ser 
100 105 110 

Ser Leu Cys Ala Leu Ser Asp Val Thr lie Ser Thr Cys His Ala Ser 
115 120 125 

Ala Lys Val Gly Thr Arg Leu Met Phe Asp His Asn Gly Lys lie lie 
130 135 140 

Gin Lys Thr Pro Tyr Pro Arg Pro Arg Gly Thr Thr Val Ser Val Gin 
145 150 155 160 

Gin Leu Phe Ser Thr Leu Pro Val Arg His Lys Glu Phe Gin Arg Asn 
165 170 175 

lie Lys Lys Glu Tyr Ala Lys Met Val Gin Val Leu His Ala Tyr Cys 
180 185 190 

lie He Ser Ala Gly lie Arg Val Ser Cys Thr Asn Gin Leu Gly Gin 
1S5 200 205 

Gly Lys Arg Gin Pro Val Val Cys Thr Gly Gly Ser Pro Ser He Lys 
210 215 220 

Glu Asn He Gly Ser Val Phe Gly Gin Lys Gin Leu Gin Ser Leu He 
225 230 235 240 

Pro Phe Val Gin Leu Pro Pro Ser Asp Ser Val Cys Glu Glu Tyr Gly 
245 250 255 

Leu Ser Cys Ser Asp Ala Leu His Asn Leu Phe Tyr He Ser Gly Phe 
260 265 270 

He Ser Gin Cys Thr His Gly Val Gly Arg Ser Ser Thr Asp Arg Gin 
275 280 285 

Phe Phe Phe He Asn Arg Arg Pro Cys Asp Pro Ala Lys Val Cys Arg 
290 295 300 

Leu Val Asn Glu Val Tyr His Met Tyr Asn Arg His Gin Tyr Pro Phe 
305 310 315 320 



10 
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Val Val Leu Asn He Ser Val Asp Ser Glu Cys Val Asp He Asn Val 
325 33 0 335 

Thr Pro Asp Lys Arg Gin lie Leu Leu Gin Glu Glu Lys Leu Leu Leu 
340 345 350 

Ala Val Leu Lys Thr Ser Leu lie Gly Met Phe Asp Ser Asp Val Asn 
355 360 3e5 

Lys Leu Asn Val Ser Gin Gin Pro Leu Leu Asp Val Glu Gly Asn Leu 
370 375 380 

He Lys Met His Ala Ala Asp Leu Glu Lys Pro Met Val Glu Lys Gin 

385 390 -joe 

395 400 

Asp Gin Ser Pro Ser Leu Arg Thr Gly Glu Glu Lys Lys Asp Val Ser 
405 410 4 15 

He Ser Arg Leu Arg Glu Ala Phe Ser Leu Arg His Thr Thr Glu Asn 
420 425 43Q 

Lys Pro His Ser Pro Lys Thr Pro Glu Pro Arg Arg Ser Pro Leu Gly 
435 440 445 

Gin Lys Arg Gly Met Leu Ser Ser Ser Thr Ser Gly Ala He Ser Asp 
450 455 4 60 

Lys Gly Val Leu Arg Pro Gin Lys Glu Ala Val Ser Ser Ser His Gly 

465 470 /-7C 

* /u 475 480 

Pro Ser Asp Pro Thr Asp Arg Ala Glu Val Glu Lys Asp Ser Gly His 
485 490 495 

Gly Ser Thr Ser Val Asp Ser Glu Gly Phe Ser He Pro Asp Thr Gly 
500 505 510 

Ser His Cys Ser Ser Glu Tyr Ala Ala Ser Ser Pro Gly Asp Arg Gly 
515 520 525 

Ser Gin Glu His Val Asp Ser Gin Glu Lys Ala Pro Glu Thr Asp Asp 
530 535 54Q 

Ser Phe Ser Asp Val Asp Cys His Ser Asn Gin Glu Asp Thr Gly Cys 
545 550 555 560 

Lys Phe Arg Val Leu Pro Gin Pro Thr Asn Leu Ala Thr Pro Asn 



565 57 0 



Thr 
575 



11 



BNSDOCID: <WO 020S48S8A1 J_» 



WO 02/054856 



PCT/U SO 1/00934 



Lys Arg Phe Lys 
580 

Lys Leu Val Asn 
595 

Val Lys lie Asn 
610 

Leu Ala Lys Arg 
625 

Gly Glu Gin Asn 



Asn Gin Ala Ala 
660 

Phe Ala Glu Met 
675 

Thr Lys Leu Asn 
690 

Glu Lys Tyr Asn 
705 

Gin Arg Leu lie 



Ala Val Leu lie 
740 

Phe Val lie Asp 
755 

Ser Leu Pro Thr 
770 

Glu Leu lie Phe 
785 

Ser Arg Val Lys 



Met lie Gly Thr 
820 



Lys Glu Glu lie 



Thr Gin Asp Met 
600 

Lys Lys Val Val 
615 

lie Lys Gin Leu 
630 

Tyr Arg Lys Phe 
645 

Glu Asp Glu Leu 



Glu He He Gly 
680 

Glu Asp lie Phe 
695 

Phe Glu Met Leu 
710 

Ala Pro Gin Thr 

725 

Glu Asn Leu Glu 



Glu Asn Ala Pro 
760 

Ser Lys Asn Trp 
775 

Met Leu Ser Asp 
790 

Gin Met Phe Ala 
805 

Ala Leu Asn Thr 



Leu Ser Ser Ser 
585 

Ser Ala Ser Gin 



Pro Leu Asp Phe 
620 

His His Glu Ala 
635 

Arg Ala Lys He 
650 

Arg Lys Glu He 
665 

Gin Phe Asn Leu 



He Val Asp Gin 
700 

Gin Gin His Thr 
715 

Leu Asn Leu Thr 
730 

He Phe Arg Lys 
745 

Val Thr Glu Arg 



Thr Phe Gly Pro 
780 

Ser Pro Gly Val 
795 

Ser Arg Ala Cys 
810 

Ser Glu Met Lys 
825 



Asp He Cys Gin 
590 

Val Asp Val Ala 
605 

Ser Met Ser Ser 



Gin Gin Ser Glu 
640 

Cys Pro Gly Glu 
655 

Ser Lys Thr Met 
670 

Gly Phe He He 
685 

His Ala Thr Asp 



Val Leu Gin Gly 
720 

Ala Val Asn Glu 
735 

Asn Gly Phe Asp 
750 

Ala Lys Leu He 
765 

Gin Asp Val Asp 



Met Cys Arg Pro 
800 

Arg Lys Ser Val 
815 

Lys Leu lie Thr 
830 
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His Met Gly Glu Met Asp His Pro Trp Asn Cys Pro His Gly Arg Pro 
835 8 4o 845 



Thr Met Arg His lie Ala Asn Leu Gly Val lie Ser Gin Asn 
850 855 860 



<210> 17 
<211> 2771 
<212> DNA 

<213> Homo sapiens 
<400> 17 

cgaggcggat cgggtgttgc atccatggag cgagctgaga gctcgagtac agaacctgct 60 
aaggccatca aacctattga tcggaagtca gtccatcaga tttgctctgg gcaggtggta 120 
ctgagtctaa gcactgcggt aaaggagtta gtagaaaaca gtctggatgc tggtgccact 180 
aatattgatc taaagcttaa ggactatgga gtggatctta ttgaagtttc agacaatgga 240 
tgtggggtag aagaagaaaa cttcgaaggc ttaactctga aacatcacac atctaagatt 300 
caagagtttg ccgacctaac tcaggttgaa acttttggct ttcgggggga agctctgagc 360 
tcactttgtg cactgagcga tgtcaccatt tctacctgcc acgcatcggc gaaggttgga 4 20 
actcgactga tgtttgatca caatgggaaa attatccaga aaacccccta cccccgcccc 4 80 
agagggacca cagtcagcgt gcagcagtta ttttccacac tacctgtgcg ccataaggaa 540 
tttcaaagga atattaagaa ggagtatgcc aaaatggtcc aggtcttaca tgcatactgt 600 
atcatttcag caggcatccg tgtaagttgc accaatcagc ttggacaagg aaaacgacag 660 
cctgtggtat gcacaggtgg aagccccagc ataaaggaaa atatcggctc tgtgtttggg 720 
cagaagcagt tgcaaagcct cattcctttt gttcagctgc cccctagtga ctccgtgtgt 780 
gaagagtacg gtttgagctg ttcggatgct ctgcataatc ttttttacat ctcaggtttc 840 
atttcacaat gcacgcatgg agttggaagg agttcaacag acagacagtt tttctttatc 900 
aaccggcggc cttgtgaccc agcaaaggtc tgcagactcg tgaatgaggt ctaccacatg 960 
tataatcgac accagtatcc atttgttgtt cttaacattt ctgttgattc agaatgcgtt 1020 
gatatcaatg ttactccaga taaaaggcaa attttgctac aagaggaaaa gcttttgttg 1080 
gcagttttaa agacctcttt gataggaatg tttgatagtg atgtcaacaa gctaaatgtc 1140 
agtcagcagc cactgctgga tgttgaaggt aacttaataa aaatgcatgc agcggatttg 1200 
gaaaagccca tggtagaaaa gcaggatcaa tccccttcat taaggactgg agaagaaaaa 12 60 
aaagacgtgt ccatttccag actgcgagag gccttttctc ttcgtcacac aacagagaac 1320 
aagcctcaca gcccaaagac tccagaacca agaaggagcc ctctaggaca gaaaaggggt 1380 
atgctgtctt ctagcacttc aggtgccatc tctgacaaag gcgtcctgag acctcagaaa 14 4 0 
gaggcagtga gttccagtca cggacccagt gaccctacgg acagagcgga ggtggagaag 1500 
gactcggggc acggcagcac ttccgtggat tctgaggggt tcagcatccc agacacgggc 15 60 
agtcactgca gcagcgagta tgcggccagc tccccagggg acaggggctc gcaggaacat 1620 
gtggactctc aggagaaagc gcctgaaact gacgactctt tttcagatgt ggactgccat 1680 
tcaaaccagg aagataccgg atgtaaattt cgagttttgc ctcagccaac taatctcgca 1740 
accccaaaca caaagcgttt taaaaaagaa gaaattcttt ccagttctga catttgtcaa 1800 
aagttagtaa atactcagga catgtcagcc tctcaggttg atgtagctgt gaaaattaat 18 60 
aagaaagttg tgcccctgga cttttctatg agttctttag ctaaacgaat aaagcagtta 19-0 
catcatgaag cacagcaaag tgaaggggaa cagaattaca ggaagtttag ggcaaagatt 1980 
tgtcctggag aaaatcaagc agccgaagat gaactaagaa aagagataag taaaacgatg 2040 
tttgcagaaa tggaaatcat tggtcagttt aacctgggat ttataataac caaactgaat 2100 

13 



BNSDOCID: <WO 02054856A1J_> 



WO 02/054856 



PCT/U S0 1/00934 



gaggatatct tcatagtgga ccagcatgcc acggacgaga agtataactt cgagatgctg 2160 

cagcagcaca ccgtgctcca ggggcagagg ctcatagcac ctcagactct caacttaact 2220 

gctgttaatg aagctgttct gatagaaaat ctggaaatat ttagaaagaa tggctttgat 2280 

tttgttatcg atgaaaatgc tccagtcact gaaagggcta aactgatttc cttgccaact 2340 

agtaaaaact ggaccttcgg accccaggac gtcgatgaac tgatcttcat gctgagcgac 2400 

agccctgggg tcatgtgccg gccttcccga gtcaagcaga tgtttgcctc cagagcctgc 24 60 

cggaagtcgg tgatgattgg gactgctctt aacacaagcg agatgaagaa actgatcacc 2520 

cacatggggg agatggacca cccctggaac tgtccccatg gaaggccaac catgagacac 2580 

atcgccaacc tgggtgtcat ttctcagaac tgaccgtagt cactgtatgg aataattggt 2640 

tttatcgcag atttttatgt tttgaaagac agagtcttca ctaacctttt ttgttttaaa 2700 

atgaaacctg ctacttaaaa aaaatacaca tcacacccat ttaaaagtga tcttgagaac 2760 

cttttcaaac c 2771 



<210> 18 

<211> 932 

<212> PRT 

<213> Homo sapiens 

<400> 18 

Met Lys Gin Leu Pro Ala Ala Thr Val Arg Leu Leu Ser Ser Ser Gin 
15 10 15 

lie lie Thr Ser Val Val Ser Val Val Lys Glu Leu lie Glu Asn Ser 
20 25 30 

Leu Asp Ala Gly Ala Thr Ser Val Asp Val Lys Leu Glu Asn Tyr Gly 
35 40 45 

Phe Asp Lys lie Glu Val Arg Asp Asn Gly Glu Gly lie Lys Ala Val 
50 55 60 

Asp Ala Pro Val Met Ala Met Lys Tyr Tyr Thr Ser Lys lie Asn Ser 
65 70 75 80 

His Glu Asp Leu Glu Asn Leu Thr Thr Tyr Gly Phe Arg Gly Glu Ala 
85 90 95 

Leu Gly Ser lie Cys Cys lie Ala Glu Val Leu lie Thr Thr Arg Thr 
100 105 110 

Ala Ala Asp Asn Phe Ser Thr Gin Tyr Val Leu Asp Gly Ser Gly His 
115 120 125 

lie Leu Ser Gin Lys Pro Ser His Leu Gly Gin Gly Thr Thr Val Thr 
130 135 140 

Ala Leu Arg Leu Phe Lys Asn Leu Pro Val Arg Lys Gin Phe Tyr Ser 

14 



BNSDOCID: <WO 02054856A1_I_> 



WO 02/054856 

PCT/USO 1/00934 

145 150 155 160 



Thr Ala Lys Lys Cys Lys Asp Glu He Lys Lys He Gin Asp Leu Leu 

175 



16 5 170 



Met Ser Phe Gly He Leu Lys Pro Asp Leu Arg He Val Phe Val 



180 185 



His 



190 



Asn Lys Ala Val He Trp Gin Lys Ser Arg Val Ser Asp His Lys Met 
195 200 205 

Ala Leu Met Ser Val Leu Gly Thr Ala Val Met Asn Asn Met Glu Ser 
210 215 220 

Phe Gin Tyr His Ser Glu Glu Ser Gin He Tyr Leu Ser Gly Phe Leu 



225 230 235 



240 



Pro Lys Cys Asp Ala Asp His Ser Phe Thr Ser Leu Ser Thr Pro Glu 
245 250 255 

Arg Ser Phe He Phe He Asn Ser Arg Pro Val His Gin Lys Asp lie 
260 265 270 

Leu Lys Leu He Arg His His Tyr Asn Leu Lys Cys Leu Lys Glu Ser 
275 280 285 

Thr Arg Leu Tyr Pro Val Phe Phe Leu Lys He Asp Val Pro Thr Ala 
290 295 300 

Asp Val Asp Val Asn Leu Thr Pro Asp Lys Ser Gin Val Leu Leu Gin 
305 310 315 3 2 o 

Asn Lys Glu Ser Val Leu He Ala Leu Glu Asn Leu Met Thr Thr Cys 
325 330 335 

Tyr Gly Pro Leu Pro Ser Thr Asn Ser Tyr Glu Asn Asn Lys Thr Asp 



340 



345 



350 



Val Ser Ala Ala Asp He Val Leu Ser Lys Thr Ala Glu Thr Asp Val 
355 360 365 

Leu Phe Asn Lys Val Glu Ser Ser Gly Lys Asn Tyr Ser Asn Val Asp 
370 375 380 

Thr Ser Val He Pro Phe Gin Asn Asp Met His Asn Asp Glu Ser Gly 
385 390 395 400 

Lys Asn Thr Asp Asp Cys Leu Asn His Gin He Ser He Gly Asp Phe 

15 
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405 410 415 

Gly Tyr Gly His Cys Ser Ser Glu lie Ser Asn lie Asp Lys Asn Thr 
420 425 430 

Lys Asn Ala Phe Gin Asp lie Ser Met Ser Asn Val Ser Trp Glu Asn 
435 440 445 

Ser Gin Thr Glu Tyr Ser Lys Thr Cys Phe lie Ser Ser Val Lys His 
450 455 460 

Thr Gin Ser Glu Asn Gly Asn Lys Asp His lie Asp Glu Ser Gly Glu 
465 470 475 480 

Asn Glu Glu Glu Ala Gly Leu Glu Asn Ser Ser Glu lie Ser Ala Asp 
485 490 495 

Glu Trp Ser Arg Gly Asn lie Leu Lys Asn Ser Val Gly Glu Asn lie 
500 505 510 

Glu Pro Val Lys lie Leu Val Pro Glu Lys Ser Leu Pro Cys Lys Val 
515 520 525 

Ser Asn Asn Asn Tyr Pro lie Pro Glu Gin Met Asn Leu Asn Glu Asp 
530 535 540 

Ser Cys Asn Lys Lys Ser Asn Val lie Asp Asn Lys Ser Gly Lys Val 
545 550 555 560 

Thr Ala Tyr Asp Leu Leu Ser Asn Arg Val lie Lys Lys Pro Met Ser 
565 570 575 

Ala Ser Ala Leu Phe Val Gin Asp His Arg Pro Gin Phe Leu lie Glu 
580 585 590 

Asn Pro Lys Thr Ser Leu Glu Asp Ala Thr Leu Gin lie Glu Glu Leu 
595 600 605 

Trp Lys Thr Leu Ser Glu Glu Glu Lys Leu Lys Tyr Glu Glu Lys Ala 
610 615 620 

Thr Lys Asp Leu Glu Arg Tyr Asn Ser Gin Met Lys Arg Ala lie Glu. 
625 630 635 640 

Gin Glu Ser Gin Met Ser Leu Lys Asp Gly Arg Lys Lys lie Lys Pro 
645 650 655 

Thr Ser Ala Trp Asn Leu Ala Gin Lys His Lys Leu Lys Thr Ser Leu 
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660 



665 670 



Ser Asn Gin Pro Lys Leu Asp Glu Leu Leu Gin Ser Gin He Glu Lys 
675 680 685 

Arg Arg Ser Gin Asn He Lys Met Val Gin He Pro Phe Ser Met Lys 
690 695 700 

Asn Leu Lys He Asn Phe Lys Lys Gin Asn Lys Val Asp Leu Glu Glu 



705 710 7 i5 



720 



Lys Asp Glu Pro Cys Leu He His Asn Leu Arg Phe Pro Asp Ala Trp 
725 73 0 ?35 

Leu Met Thr Ser Lys Thr Glu Val Met Leu Leu Asn Pro Tyr Arg Val 
740 745 750 

Glu Glu Ala Leu Leu Phe Lys Arg Leu Leu Glu Asn His Lys Leu Pro 
755 760 765 

Ala Glu Pro Leu Glu Lys Pro He Met Leu Thr Glu Ser Leu Phe Asn 
770 775 7 80 

Gly Ser His Tyr Leu Asp Val Leu Tyr Lys Met Thr Ala Asp Asp Gin 
785 790 795 800 

Arg Tyr Ser Gly Ser Thr Tyr Leu Ser Asp Pro Arg Leu Thr Ala Asn 
805 810 815 

Gly Phe Lys He Lys Leu lie Pro Gly Val Ser He Thr Glu Asn Tyr 
820 8 25 830 

Leu Glu He Glu Gly Met Ala Asn Cys Leu Pro Phe Tyr Gly Val Ala 



835 



840 



845 



Asp Leu Lys Glu He Leu Asn Ala He Leu Asn Arg Asn Ala Lys Glu 
850 855 ago 

Val Tyr Glu Cys Arg Pro Arg Lys Val He Ser Tyr Leu Glu Gly Glu 



865 87 0 875 



880 



Ala Val Arg Leu Ser Arg Gin Leu Pro Met Tyr Leu Ser Lys Glu Asp 
885 890 895 

He Gin Asp He He Tyr Arg Met Lys His Gin Phe Gly Asn Glu He 
900 905 910 

Lys Glu Cys Val His Gly Arg Pro Phe Phe His His Leu Thr Tyr Leu 

17 
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PCT/US01/00934 

925 



Pro Glu Thr Thr 
930 



<210> 19 

<211> 3063 

<212> DNA 

<213> Homo sapiens 

<400> 19 

ggcacgagtg gctgcttgcg gctagtggat 
ctgctctgtt aaaagcgaaa atgaaacaat 
gttctcagat catcacttcg gtggtcagtg 
atgctggtgc cacaagcgta gatgttaaac 
tgcgagataa cggggagggt atcaaggctg 
acacctcaaa aataaatagt catgaagatc 
gagaagcctt ggggtcaatt tgttgtatag 
ctgataattt tagcacccag tatgttttag 
cttcacatct tggtcaaggt acaactgtaa 
taagaaagca gttttactca actgcaaaaa 
atctcctcat gagctttggt atccttaaac 
aggcagttat ttggcagaaa agcagagtat 
tggggactgc tgttatgaac aatatggaat 
tttatctcag tggatttctt ccaaagtgtg 
caccagaaag aagtttcatc ttcataaaca 
agttaatccg acatcattac aatctgaaat 
ttttctttct gaaaatcgat gttcctacag 
aaagccaagt attattacaa aataaggaat 
cgacttgtta tggaccatta cctagtacaa 
ccgcagctga catcgttctt agtaaaacag 
aatcatctgg aaagaattat tcaaatgttg 
tgcataatga tgaatctgga aaaaacactg 
gtgactttgg ttatggtcat tgtagtagtg 
atgcatttca ggacatttca atgagtaatg 
gtaaaacttg ttttataagt tccgttaagc 
atatagatga gagtggggaa aatgaggaag 
ctgcagatga gtggagcagg ggaaatatac 
ctgtgaaaat tttagtgcct gaaaaaagtt 
caatccctga acaaatgaat cttaatgaag 
ataataaatc tggaaaagtt acagcttatg 
ccatgtcagc aagtgctctt tttgttcaag 
ctaagactag tttagaggat gcaacactac 
aagaggaaaa actgaaatat gaagagaagg 
aaatgaagag agccattgaa caggagtcac 
taaaacccac cagcgcatgg aatttggccc 
atcaaccaaa acttgatgaa ctccttcagt 

18 



ggtaattgcc tgcctcgcgc tagcagcaag 60 
tgcctgcggc aacagttcga ctcctttcaa 120 
ttgtaaaaga gcttattgaa aactccttgg 180 
tggagaacta tggatttgat aaaattgagg 240 
ttgatgcacc tgtaatggca atgaagtact 300 
ttgaaaattt gacaacttac ggttttcgtg 360 
ctgaggtttt aattacaaca agaacggctg 420 
atggcagtgg ccacatactt tctcagaaac 480 
ctgctttaag attatttaag aatctacctg 540 
aatgtaaaga tgaaataaaa aagatccaag 600 
ctgacttaag gattgtcttt gtacataaca 660 
cagatcacaa gatggctctc atgtcagttc 720 
cctttcagta ccactctgaa gaatctcaga 780 
atgcagacca ctctttcact agtctttcaa 840 
gtcgaccagt acatcaaaaa gatatcttaa 900 
gcctaaagga atctactcgt ttgtatcctg 960 
ctgatgttga tgtaaattta acaccagata 1020 
ctgttttaat tgctcttgaa aatctgatga 1080 
attcttatga aaataataaa acagatgttt 1140 
cagaaacaga tgtgcttttt aataaagtgg 1200 
atacttcagt cattccattc caaaatgata 1260 
atgattgttt aaatcaccag ataagtattg 1320 
aaatttctaa cattgataaa aacactaaga 1380 
tatcatggga gaactctcag acggaatata 1440 
acacccagtc agaaaatggc aataaagacc 1500 
aagcaggtct tgaaaactct tcggaaattt 1560 
ttaaaaattc agtgggagag aatattgaac 1620 
taccatgtaa agtaagtaat aataattatc 1680 
attcatgtaa caaaaaatca aatgtaatag 1740 
atttacttag caatcgagta atcaagaaac 1800 
atcatcgtcc tcagtttctc atagaaaatc 1860 
aaattgaaga actgtggaag acattgagtg 1920 
ctactaaaga cttggaacga tacaatagtc 1980 
aaatgtcact aaaagatggc agaaaaaaga 2040 
agaagcacaa gttaaaaacc tcattatcta 2100 
cccaaattga aaaaagaagg agtcaaaata 2160 
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ttaaaatggt acagatcccc ttttctatga aaaacttaaa aataaatttt aagaaacaaa ^0 
acaaagttga cttagaagag aaggatgaac cttgcttgat ccacaatctc ag£"c"g -80 
atgcatggct aatgacatcc aaaacagagg taatgttatt aaatccatat ajagta^ ~^To 
IZlTZt a lt taaaaga CttC ^ atcat — t tcctgcagag ccactjga ! ^ 
IIT* gttaaca ^ agtcttttta atggatctca ttatttagac gttttatata " 6 0 

aaatgacagc agatgaccaa agatacagtg gatcaactta cctgtctgat cctcgtctta « 
cagcgaatgg tttcaagata aaattgatac caggagtttc aattactgaa ^tactt^ 2580 

t ta a t a g 9 : t a a g t :itiT at tg r cccat tctatggagt agcagatt - 

tiaa^tt 3ttaaaCaga aat ^aaagg aagtttatga atgtagacct cgcaaagtga 700 
taagttattt agagggagaa gcagtgcgtc tatccagaca attacccatg tacttatcaa -760 

EH Ft — »~ S 
i Er =" = 

ctgacttgtt tttatattga aaaaagttcc acgtattgta gaaaacgtaa ataaactaat 3060 



3063 



<210> 20 

<211> 934 

<212> PRT 

<213> Homo sapiens 



<400> 20 

Met Ala Val Gin Pro Lys Glu Thr Leu Qln Leu ^ ^ ^ ^ ^ 



!0 15 



Val sly Phe val Arg Phe Phe Gin Gly Met Pro Glu Lys Pro Thr Thr 
20 2 5 30 

Thr Val Arg Leu Phe Asp Arg Gly Asp Phe Tyr Thr Ala His Gly Glu 

40 45 

Asp Ala Leu Leu Ala Ala Arg Glu Val Phe Lys Thr Gin Gly Val lie 

55 60 

Lys Tyr Met Gly Pro Ala Gly. Ala Lys Asn Leu Gin Ser Val Val Leu 

70 7 c 

/b 80 
Ser Lys Met Asn Phe Glu Ser Phe Val Lys Asp Leu Leu Leu Val Arg 



95 



Gin Tyr Arg Val Glu Val Tyr Lys Asn Arg Ala Gly Asn 



100 



105 



Lys Ala Ser 
110 



LYS Glu Asn Asp Trp Tyr Leu Ala Tyr Lys Ala Ser Pro Gly Asn Leu 



19 
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Ser Gin Phe Glu Asp He Leu Phe Gly Asn Asn Asp Met Ser Ala Ser 
130 135 140 

He Gly Val Val Gly Val Lys Met Ser Ala Val Asp Gly Gin Arc Gin 
145 150 155 160 

Val Gly Val Gly Tyr Val Asp Ser He Gin Arg Lys Leu Gly Leu Cys 
165 170 175 

Glu Phe Pro Asp Asn Asp Gin Phe Ser Asn Leu Glu Ala Leu Leu He 
180 185 190 

Gin lie Gly Pro Lys Glu Cys Val Leu Pro Gly Gly Glu Thr Ala Gly 
195 200 205 

Asp Met Gly Lys Leu Arg Gin He He Gin Arg Gly Gly He Leu He 
210 215 220 

Thr Glu Arg Lys Lys Ala Asp Phe Ser Thr Lys Asp He Tyr Gin Asp 
225 230 235 240 

Leu Asn Arg Leu Leu Lys Gly Lys Lys Gly Glu Gin Met Asn Ser Ala 
245 250 255 

Val Leu Pro Glu Met Glu Asn Gin Val Ala Val Ser Ser Leu Ser Ala 
260 265 270 

Val He Lys Phe Leu Glu Leu Leu Ser Asp Asp Ser Asn Phe Gly Gin 
275 280 285 

Phe Glu Leu Thr Thr Phe Asp Phe Ser Gin Tyr Met Lys Leu Asp lie 
290 295 300 

Ala Ala Val Arg Ala Leu Asn Leu Phe Gin Gly Ser Val Glu Asp Thr 
305 310 315 320 

Thr Gly Ser Gin Ser Leu Ala Ala Leu Leu Asn Lys Cys Lys Thr Pro 
325 330 335 

Gin Gly Gin Arg Leu Val Asn Gin Trp He Lys Gin Pro Leu Met Asp 
340 345 350 

Lys Asn Arg lie Glu Glu Arg Leu Asn Leu Val Glu Ala Phe Val Glu 
355 360 365 

Asp Ala Glu Leu Arg Gin Thr Leu Gin Glu Asp Leu Leu Arg Arg Phe 
370 375 380 
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Pro Asp Leu Asn Arg Ala Lys a ^ 

• DOD 390 



395 



400 



Leu Gin Asp Cys Tyr Arg Leu Tyr Gin Gly He Asn Gin Leu Pro Asn 
405 415 

Val He Gin Ala Leu Glu Lys His Glu Gly Lys His Gin Lys Leu Leu 
A "° ^5 43Q 

Leu Ala Val Phe Val Thr Pro Leu Thr Asp Leu Arg Ser Asp Phe Ser 



440 



445 



Lys Phe Gin Glu Met He Glu Thr Thr Leu Asp Met Asp Gin Val Glu 



455 



460 



Asn His Glu Phe Leu Val Lys Pro Ser Phe Asp Pro Asn Leu Ser Glu 

HOD 470 



475 



480 



Leu Arg Glu lie Met Asn Asp Leu Glu Lys Lys Met Gin Ser Thr Leu 
485 495 

He Ser Ala Ala Arg Asp Leu Gly Leu Asp Pro Gly Lys Gin He Lys 
500 505 510 

Leu Asp Ser Ser Ala Gin Phe Gly Tyr Tyr Phe Arg Val Thr Cys Lys 

520 525 

Glu Glu Lys val Leu Arg Asn Asn Lys Asn Phe Ser Thr Val Asp He 

535 

Gin Lys Asn Gly Val Lys Phe Thr Asn Ser Lys Leu Thr Ser Leu Asn 

550 5 " 560 

Glu Glu Tyr Thr Lys Asn Lys Thr Glu Tyr Glu Glu Ala Gin Asp Ala 

5 65 



570 



575 



He Val Lys Glu He Val Asn He Ser Ser Gly Tyr Val Glu Pro 



580 



585 



Met 



590 



Gin Thr Leu Asn Asp Val Leu Ala Gin Leu Asp Ala Val Val Ser Phe 
595 600 605 

Ala His Val Ser Asn Gly Ala Pro Val Pro Tyr Val Arg Pro Ala He 

615 620 

Leu Glu Lys Gly Gin Gly Arg lie He Leu Lys Ala Ser Arg His Ala 



630 



635 



640 
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Cys Val Glu Val Gin Asp Glu lie Ala Phe lie Pro Asn Asp Val Tyr 
645 650 655 

Phe Glu Lys Asp Lys Gin Met Phe His lie lie Thr Gly Pro Asn Met 
660 665 670 

Gly Gly Lys Ser Thr Tyr lie Arg Gin Thr Gly Val lie Val Leu Met 
675 680 685 

Ala Gin lie Gly Cys Phe Val Pro Cys Glu Ser Ala Glu Val Ser lie 
690 695 700 

Val Asp Cys lie Leu Ala Arg Val Gly Ala Gly Asp Ser Gin Leu Lys 
705 710 715 720 

Gly Val Ser Thr Phe Met Ala Glu Met Leu Glu Thr Ala Ser lie Leu 
725 730 735 

Arg Ser Ala Thr Lys Asp Ser Leu lie lie lie Asp Glu Leu Gly Arg 
740 745 750 

Gly Thr Ser Thr Tyr Asp Gly Phe Gly Leu Ala Trp Ala lie Ser Glu 
755 760 765 

Tyr lie Ala Thr Lys lie Gly Ala Phe Cys Met Phe Ala Thr His Phe 
770 775 780 

His Glu Leu Thr Ala Leu Ala Asn Gin lie Pro Thr Val Asn Asn Leu 
785 790 795 800 

His Val Thr Ala Leu Thr Thr Glu Glu Thr Leu Thr Met Leu Tyr Gin 
805 810 815 

Val Lys Lys Gly Val Cys Asp Gin Ser Phe Gly lie His Val Ala Glu 
820 825 830 

Leu Ala Asn Phe Pro Lys His Val lie Glu Cys Ala Lys Gin Lys Ala 
835 840 845 

Leu Glu Leu Glu Glu Phe Gin Tyr lie Gly Glu Ser Gin Gly Tyr Asp 
850 855 860 

lie Met Glu Pro Ala Ala Lys Lys Cys Tyr Leu Glu Arg Glu Gin Gly 
865 870 875 880 

Glu Lys lie lie Gin Glu Phe Leu Ser Lys Val Lys Gin Met Pro Phe 
885 890 895 



22 
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Thr Glu Met Ser Glu Glu 
900 

Ala Glu Val lie Ala Lys 
915 

Arg He Lys Val Thr Thr 
930 



Asn He Thr He Lys Leu 
905 

Asn Asn Ser Phe Val Asn 
920 



PCT/US0 1/00934 

Lys Gin Leu Lys 
910 

Glu He He Ser 
925 



<210> 21 
<211> 3145 
<212> DNA 

<213> Homo sapiens 
<400> 21 



ggcgggaaac agcttagtgg gtgtggggtc gcgcattttc ttn,* 

gtttcgacat ggcggtgcag ccgaaggaga cgctgclatt aaTa " 60 
gcttcgtgcg cttctttcaa aacat^L C9 ° tgCagtt ^agagcgcg gccgaggtcg 120 
accggggcgl cttctataca * ** caccacagtg cgccttttcg 180 

tcaa a eL ggggg t !^ lZt ac T g aggaCgCgCt *ctggccgcc cgggaggtgt 240 
ttgtgcttag ta^»t "SScS ctgcagagtg 300 

atagagttga agtttaLaq !l! T ttgta3aaga ^tcttctg gttcgtcagt 360 
atttggcata tl gg Tt c l ^«Sc ft^ ^ aatgattggt 420 

acaatgatat gtcagcttcc attaatatta ^^f^ ^.gacatt ctctttggta 480 
agagacaggt tggagttggg tatSggatt clT " ******** ^tgatggcc 540 

tccctgataa tgatcagJtc ^ccStctta ^J"™ tagga ctgtgtgaat 600 

aatgtgtttt acccgga^ gaga^cta oil * CatCCagatt ^ccaaagg 660 
aaagaggagg aattctqftc acf™ ^agacatggg gaaactgaga cagataattc 720 

atclggacct caiccggttg ttolT^ a3aaagCtga «tftccaca aaagacattt 780 
tgccagaaat ggaga^ J^" l^T^ ^ 

aactcttatc agatgattcc aacttt!^ a ^ aCtgtC ^Staatc aagtttttag 900 
agtatatgaa attggatatt gcagcalL a! ^ 9aCtaCtttt S«cttcgcc 960 
aagataccac tggctctcao gagcccttaa cctttttcag ggttctgttg 1020 

gacaaagact tgttaac Ca | tgLtSac ^T'*" ta9gtgtaaa "ccctcaag 1080 
agagattgaa tttagtggal gctt"rtla """f^ agaatagagg 1140 

aagatttact tcgtcg"" cclaatctt, actttacaag 1200 

cagcaaactt acLgattgt tac^ctct ?° C3agaagttt — gacaag 1260 

tacaggctct ggaa aaca f at -gggtat aaatcaacta cctaatgtta 1320 

ctcctcttac tgatcttcgt — gaaatt attgttggca gtttttgtga 138 0 

tagatatgga tcaggtggla £ tcct! "'"'^ ^aacaactt !440 

tcagtgaatt aagagaaata t^" 9 ^ 3 acctt -ttt gatcctaatc 1500 

gtgcagccag agatcttggc ttggacccto '" aaaa9aa ^gcagtca acattaataa 1560 
agtttggata ttactttcgt g tlT cct Tt a ITaT^ ta " Ct « at tccagtgcac 1620 
actttagtac tgtagatatc llToT^ 1680 

ctttaaatga agagtatacc aaL^ ^gttaaatt taccaacagc aaattgactt 1740 
ttaaagaaat tgtcaata"tt tcttc^ ^^c^ ^gccattg 1800 

tgttagctca gctagatgct ^tcaact f^ 93 *^ Mt ^« ctcaatgatg I860 
get gttgtcagct ttgctcacgt gtcaaatgga gcacctgttc 1920 
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catatgtacg accagccatt ttggagaaag gacaaggaag aattatatta aaagcatcca 1980 

ggcatgcttg tgttgaagtt caagatgaaa ttgcatttat tcctaatgac gtatactttg 2040 

aaaaagataa acagatgttc cacatcatta ctggccccaa tatgggaggt aaatcaacat 2100 

atattcgaca aactggggtg atagtactca tggcccaaat tgggtgtttt gtgccatgtg 2160 

agtcagcaga agtgtccatt gtggactgca tcttagcccg agtaggggct ggtgacagtc 2220 

aattgaaagg agtctccacg ttcatggctg aaatgttgga aactgcttct atcctcaggt 2280 

ctgcaaccaa agattcatta ataatcatag atgaattggg aagaggaact tctacctacg 2340 

atggatttgg gttagcatgg gctatatcag aatacattgc aacaaagatt ggtgcttttt 2400 

gcatgtttgc aacccatttt catgaactta ctgccttggc caatcagata ccaactgtta 24 60 

ataatctaca tgtcacagca ctcaccactg aagagacctt aactatgctt tatcaggtga 2520 

agaaaggtgt ctgtgatcaa agttttggga ttcatgttgc agagcttgct aatttcccta 2580 

agcatgtaat agagtgtgct aaacagaaag ccctggaact tgaggagttt cagtatattg 2640 

gagaatcgca aggatatgat atcatggaac cagcagcaaa gaagtgctat ctggaaagag 2700 

agcaaggtga aaaaattatt caggagttcc tgtccaaggt gaaacaaatg ccctttactg 27 60 

aaatgtcaga agaaaacatc acaataaagt taaaacagct aaaagctgaa gtaatagcaa 2820 

agaataatag ctttgtaaat gaaatcattt cacgaataaa agttactacg tgaaaaatcc 2880 

cagtaatgga- atgaaggtaa tattgataag ctattgtctg taatagtttt atattgtttt 2940 

atattaaccc tttttccata gtgttaactg tcagtgccca tgggctatca acttaataag 3000 

atatttagta atattttact ttgaggacat tttcaaagat ttttattttg aaaaatgaga 3060 

gctgtaactg aggactgttt gcaattgaca taggcaataa taagtgatgt gctgaatttt 3120 

ataaataaaa tcatgtagtt tgtgg 3145 



<210> 22 
<211> 756 
<212> PRT 

<213> Homo sapiens 
<400> 22 

Met Ser Phe Val Ala Gly Val lie Arg Arg Leu Asp Glu Thr Val Val 
15 10 15 

Asn Arg lie Ala Ala Gly Glu Val lie Gin Arg Pro Ala Asn Ala lie 

20 25 30 

Lys Glu Met lie Glu Asn Cys Leu Asp Ala Lys Ser Thr Ser lie Gin 
35 40 45 

Val lie Val Lys Glu Gly Gly Leu Lys Leu lie Gin lie Gin Asp Asn 
50 55 60 

Gly Thr Gly lie Arg Lys Glu Asp Leu Asp lie Val Cys Glu Arg Phe 
65 70 75 80 

Thr Thr Ser Lys Leu Gin Ser Phe Glu Asp Leu Ala Ser lie Ser Thr 
85 90 95 

Tyr Gly Phe Arg Gly Glu Ala Leu Ala Ser lie Ser His Val Ala His 
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100 



105 



110 



Val Thr lie Thr Thr Lys Thr Ala Asp Gly Lys Cys Ala Tyr ^ 

125 

Ser Tyr Ser Asp Gly Lys Leu Lys Ala Pro Pro Lys Pro Cys ^ Gly 



Asn Gin Gly Thr cm Ile Thr Val ^ Asp ^ ^ ^ ^ ^ ^ 

150 



155 



160 



Thr Arg Arg Lys Ala Leu Lys Asn Pro Ser Glu Glu Tyr Gly Lys He 
165 17 ° 175 

Leu Glu Val Val Gly Arg Tyr Ser Val His Asn Ala Gly lie Ser Phe 
180 185 190 

Ser Val Lys Lys Gin Gly Glu Thr Val Ala Asp Val Arg Thr Leu Pro 
195 200 205 

Asn Ala Ser Thr Val Asp Asn lie Arg Ser lie Phe Gly Asn Ala Val 

215 220 

Ser Arg Glu Leu He Glu He Gly Cys Glu Asp Lys Thr Leu Ala Phe 

230 235 24Q 

Lys Met Asn Gly Tyr lie Ser Asn Ala Asn Tyr Ser Val Lys Lys Cys 

245 - - 



250 



255 



He Phe Leu Leu Phe He Asn His Arg Leu Val Glu Ser Thr Ser Leu 

<t bU 



265 



270 



Arg Lys Ala Ile Glu Thr Val Tyr Ala Ala Tyr Leu 



275 



280 



Pro Lys Asn Thr 
285 

His Pro Phe Leu Tyr Leu Ser Leu Glu lie Ser Pro Gin Asn Val Asp 

295 300 

Val Asn Val His Pro Thr Lys His Glu Val His Phe Leu His Glu Glu 

310 31 5 320 

Ser lie Leu Glu Arg Val Gin Gin His lie Glu Ser Lys Leu Leu Gly 
325 3 30 335 

Ser Asn Ser Ser Arg Met Tyr Phe Thr Gin Thr Leu Leu Pro Gly Leu 
340 3« 350 

Ala Gly Pro Ser Gly Glu Met Val Lys Ser Thr Thr Ser Leu Thr Ser 
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355 360 365 

Ser Ser Thr Ser Gly Ser Ser Asp Lys Val Tyr Ala His Gin Met Val 
370 375 380 

Arg Thr Asp Ser Arg Glu Gin Lys Leu Asp Ala Phe Leu Gin Pro Leu 
385 390 395 400 

Ser Lys Pro Leu Ser Ser Gin Pro Gin Ala lie Val Thr Glu Asp Lys 
405 410 415 

Thr Asp lie Ser Ser Gly Arg Ala Arg Gin Gin Asp Glu Glu Met Leu 
420 425 430 

Glu Leu Pro Ala Pro Ala Glu Val Ala Ala Lys Asn Gin Ser Leu Glu 
435 440 445 

Gly Asp Thr Thr Lys Gly Thr Ser Glu Met Ser Glu Lys Arg Gly Pro 
450 455 460 

Thr Ser Ser Asn Pro Arg Lys Arg His Arg Glu Asp Ser Asp Val Glu 
465 470 475 480 

Met Val Glu Asp Asp Ser Arg Lys Glu Met Thr Ala Ala Cys Thr Pro 
485 490 495 

Axg Arg Arg lie lie Asn Leu Thr Ser Val Leu Ser Leu Gin Glu Glu 
500 505 510 

lie Asn Glu Gin Gly His Glu Val Leu Arg Glu Met Leu His Asn His 
515 520 525 

Ser Phe Val Gly Cys Val Asn Pro Gin Trp Ala Leu Ala Gin His Gin 
530 535 540 

Thr Lys Leu Tyr Leu Leu Asn Thr Thr Lys Leu Ser Glu Glu Leu Phe 
545 550 555 560 

Tyr Gin lie Leu lie Tyr Asp Phe Ala Asn Phe Gly Val Leu Arg Leu 
565 570 575 

Ser Glu Pro Ala Pro Leu Phe Asp Leu Ala Met Leu Ala Leu Asp Ser 
580 585 590 

Pro Glu Ser Gly Trp Thr Glu Glu Asp Gly Pro Lys Glu Gly Leu Ala 
595 600 605 

Glu Tyr lie Val Glu Phe Leu Lys Lys Lys Ala Glu Met Leu Ala Asp 
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620 
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Tyr Phe Ser Leu 
625 

Leu Leu lie Asp 

lie Leu Arg Leu 
660 

Phe Glu Ser Leu 
675 

Gin Tyr lie Ser 
690 

Pro Gly Ser lie 
705 

Tyr Lys Ala Leu 



Asp Gly Asn He 
740 

Phe Glu Arg Cys 
755 



Glu He Asp Glu 
630 

Asn Tyr Val Pro 
645 

Ala Thr Glu Val 



Ser Lys Glu Cys 
680 

Glu Glu Ser Thr 
695 

Pro Asn Ser Trp 
710 

Arg Ser His He 
725 

Leu Gin Leu Ala 



Glu Gly Asn Leu 
635 

Pro Leu Glu Gly 
650 

Asn Trp Asp Glu 
665 

Ala Met Phe Tyr 



Leu Ser Gly Gin 
700 

Lys Trp Thr Val 
715 

Leu Pro Pro Lys 
730 

Asn Leu Pro Asp 
745 



He Gly Leu Pro 
640 

Leu Pro He Phe 
655 

Glu Lys Glu Cys 
670 

Ser He Arg Lys 
685 

Gin Ser Glu Val 



Glu His He Val 
720 

His Phe Thr Glu 
735 

Leu Tyr Lys Val 
750 



<210> 23 

<211> 2484 

<212> DNA 

<213> Homo sapiens 



<400> 23 

cttggctctt ctggcgccaa aatgtcgttc 
acagtggtga accgcatcgc ggcgggggaa 
gagatgattg agaactgttt agatgcaaaa 
ggaggcctga agttgattca gatccaagac 
gatattgtat gtgaaaggtt cactactagt 
atttctacct atggctttcg aggtgaggct 
actattacaa cgaaaacagc tgatggaaag 
aaactgaaag cccctcctaa accatgtgct 
gacctttttt acaacatagc cacgaggaga 
gggaaaattt tggaagttgt tggcaggtat 
gttaaaaaac aaggagagac agtagctgat 
gacaatattc gctccatctt tggaaatgct 

27 



gtggcagggg ttattcggcg gctggacgag 60 
gttatccagc ggccagctaa tgctatcaaa 120 
tccacaagta ttcaagtgat tgttaaagag 180 
aatggcaccg ggatcaggaa agaagatctg 240 
aaactgcagt cctttgagga tttagccagt 300 
ttggccagca taagccatgt ggctcatgtt 360 
tgtgcataca gagcaagtta ctcagatgga 420 
ggcaatcaag ggacccagat cacggtggag 480 
aaagctttaa aaaatccaag tgaagaatat 54 0 
tcagtacaca atgcaggcat tagtttctca 600 
gttaggacac tacccaatgc ctcaaccgtg 660 
gttagtcgag aactgataga aattggatgt 720 
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gaggataaaa ccctagcctt caaaatgaat ggttacatat ccaatgcaaa ctactcagtg 7 80 
aagaagtgca tcttcttact cttcatcaac catcgtctgg tagaatcaac ttccttgaga 840 
aaagccatag aaacagtgta tgcagcctat ttgcccaaaa acacacaccc attcctgtac 900 
ctcagtttag aaatcagtcc ccagaatgtg gatgttaatg tgcaccccac aaagcatgaa 960 
gttcacttcc tgcacgagga gagcatcctg gagcgggtgc agcagcacat cgagagcaag 1020 
ctcctgggct ccaattcctc caggatgtac ttcacccaga ctttgctacc aggacttgct 1080 
ggcccctctg gggagatggt taaatccaca acaagtctga cctcgtcttc tacttctgga 114 0 
agtagtgata aggtctatgc ccaccagatg gttcgtacag attcccggga acagaagctt 1200 
gatgcatttc tgcagcctct gagcaaaccc ctgtccagtc agccccaggc cattgtcaca 1260 
gaggataaga cagatatttc tagtggcagg gctaggcagc aagatgagga gatgcttgaa 1320 
ctcccagccc ctgctgaagt ggctgccaaa aatcagagct tggaggggga tacaacaaag 1380 
gggacttcag aaatgtcaga gaagagagga cctacttcca gcaaccccag aaagagacat 14 4 0 
cgggaagatt ctgatgtgga aatggtggaa gatgattccc gaaaggaaat gactgcagct 1500 
tgtacccccc ggagaaggat cattaacctc actagtgttt tgagtctcca ggaagaaatt 1560 
aatgagcagg gacatgaggt tctccgggag atgttgcata accactcctt cgtgggctgt 1620 
gtgaatcctc agtgggcctt ggcacagcat caaaccaagt tataccttct caacaccacc 1680 
aagcttagtg aagaactgtt ctaccagata ctcatttatg attttgccaa ttttggtgtt 17 40 
ctcaggttat cggagccagc accgctcttt gaccttgcca tgcttgcctt agatagtcca 1800 
gagagtggct ggacagagga agatggtccc aaagaaggac ttgctgaata cattgttgag 1860 
tttctgaaga agaaggctga gatgcttgca gactatttct ctttggaaat tgatgaggaa 1920 
gggaacctga ttggattacc ccttctgatt gacaactatg tgcccccttt ggagggactg 1980 
cctatcttca ttcttcgact agccactgag gtgaattggg acgaagaaaa ggaatgtttt 204 0 
gaaagcctca gtaaagaatg cgctatgttc tattccatcc ggaagcagta catatctgag 2100 
gagtcgaccc tctcaggcca gcagagtgaa gtgcctggct ccattccaaa ctcctggaag 2160 
tggactgtgg aacacattgt ctataaagcc ttgcgctcac acattctgcc tcctaaacat 2220 
ttcacagaag atggaaatat cctgcagctt gctaacctgc ctgatctata caaagtcttt 2280 
gagaggtgtt aaatatggtt atttatgcac tgtgggatgt gttcttcttt ctctgtattc 2340 
cgatacaaag tgttgtatca aagtgtgata tacaaagtgt accaacataa gtgttggtag 24 00 
cacttaagac ttatacttgc cttctgatag tattccttta tacacagtgg attgattata 24 60 
aataaataga tgtgtcttaa cata 24 84 



<210> 24 
<211> 133 
<212> PRT 

<213> Homo sapiens 
<400> 24 

Met Lys Gin Leu Pro Ala Ala Thr Val Arg Leu Leu Ser Ser Ser Gin 
15 10 15 

He He Thr Ser Val Val Ser Val Val Lys Glu Leu He Glu Asn Ser 
20 25 30 

Leu Asp Ala Gly Ala Thr Ser Val Asp Val Lys Leu Glu Asn Tyr Gly 
35 40 45 

Phe Asp Lys lie Glu Val Arg Asp Asn Gly Glu Gly lie Lys Ala Val 

28 
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50 55 



60 



Asp Ala Pro Val Met Ala Met Lys Tyr Tyr Thr Ser Lys He Asn Ser 
65 70 75 80 

His Glu Asp Leu Glu Asn Leu Thr Thr Tyr Gly Phe Arg Gly Glu Al 



85 90 



a 

95 



Leu Gly Ser He Cys Cys He Ala Glu Val Leu He Thr Thr Arg Thr 
100 105 110 

Ala Ala Asp Asn Phe Ser Thr Gin Tyr Val Leu Asp Gly Ser Gly His 
115 120 125 

He Leu Ser Gin Lys 
130 



<210> 25 
<211> 426 
<212> DNA 

<213> Homo sapiens 
<400> 25 

cgaggcggat cgggtgttgc atccatggag 
aaggccatca aacctattga tcggaagtca 
ctgagtctaa gcactgcggt aaaggagtta 
aatattgatc taaagcttaa ggactatgga 
tgtggggtag aagaagaaaa cttcgaaggc 
caagagtttg ccgacctaac tcaggttgaa 
tcactttgtg cactgagcga tgtcaccatt 
acttga 



cgagctgaga gctcgagtac agaacctgct 60 
gtccatcaga tttgctctgg gcaggtggta 120 
gtagaaaaca gtctggatgc tggtgccact 180 
gtggatctta ttgaagtttc agacaatgga 240 
ttaactctga aacatcacac atctaagatt 300 
acttttggct ttcgggggga agctctgagc 360 
tctacctgcc acgcatcggc gaaggttgga 420 

426 



<210> 26 
<211> 1360 
<212> PRT 

<213> Homo sapiens 
<400> 26 

Met Ser Arg Gin Ser Thr Leu Tyr 
1 5 

Leu Ser Asp Ala Asn Lys Ala Ser 
20 

Arg Ala Ala Ala Ala Pro Gly Ala 

35 40 

29 



Ser Phe Phe Pro Lys Ser Pro Ala 
10 15 

Ala Arg Ala Ser Arg Glu Gly Gly 
25 30 

Ser Pro Ser Pro Gly Gly Asp Ala 
45 
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Ala Trp Ser Glu Ala Gly Pro Gly Pro Arg Pro Leu Ala Arg Ser Ala 
50 55 60 

Ser Pro Pro Lys Ala Lys Asn Leu Asn Gly Gly Leu Arg Arg Ser Val 
65 70 75 80 

Ala Pro Ala Ala Pro Thr Ser Cys Asp Ph.- Ser Pro Gly Asp Leu Val 
85 90 95 

Trp Ala Lys Met Glu Gly Tyr Pro Trp Trp Pro Cys Leu Val Tyr Asn 
100 105 110 

His Pro Phe Asp Gly Thr Phe lie Arg Glu Lys Gly Lys Ser Val Arg 
115 120 125 

Val His Val Gin Phe Phe Asp Asp Ser Pro Thr Arg Gly Trp Val Ser 
130 135 140 

Lys Arg Leu Leu Lys Pro Tyr Thr Gly Ser Lys Ser Lys Glu Ala Gin 
145 150 155 160 

Lys Gly Gly His Phe Tyr Ser Ala Lys Pro Glu lie Leu Arg Ala Met 
165 170 175 

Gin Arg Ala Asp Glu Ala Leu Asn Lys Asp Lys lie Lys Arg Leu Glu 
180 185 190 

Leu Ala Val Cys Asp Glu Pro Ser Glu Pro Glu Glu Glu Glu Glu Met 
195 200 205 

Glu Val Gly Thr Thr Tyr Val Thr Asp Lys Ser Glu Glu Asp Asn Glu 
210 215 220 

lie Glu Ser Glu Glu Glu Val Gin Pro Lys Thr Gin Gly Ser Arg Arg 
225 230 235 240 

Ser Ser Arg Gin lie Lys Lys Arg Arg Val lie Ser Asp Ser Glu Ser 
245 250 255 

Asp lie Gly Gly Ser Asp Val Glu Phe Lys Pro Asp Thr Lys Glu Glu 
260 265 270 

Gly Ser Ser Asp Glu lie Ser Ser Gly Val Gly Asp Ser Glu Ser Glu 
275 280 285 

Gly Leu Asn Ser Pro Val Lys Val Ala Arg Lys Arg Lys Arg Met Val 
290 295 300 

30 



BNSDOCIO: <WO 0205485SA1_I_> 



VV ° 02/054856 PCT/US01/00934 

Thr Gly Asn Gly Ser Leu Lys Arg Lys Ser Ser Arg Lys Glu Thr Pro 
305 310 315 320 

Ser Ala Thr Lys Gin Ala Thr Ser He Ser Ser Glu Thr Lys Asn Thr 
325 330 335 

Leu Arc* Ala Phe' Ser Ala Pro Gin Asn Ser Glu Ser Gin Ala His Val 
340 345 350 

Ser Gly Gly Gly Asp Asp Ser Ser Arg Pro Thr Val Trp Tyr His Glu 
355 360 365 

Thr Leu Glu Trp Leu Lys Glu Glu Lys Arg Arg Asp Glu His Arg Arg 
370 375 380 

Arg Pro Asp His Pro Asp Phe Asp Ala Ser Thr Leu Tyr Val Pro Glu 
385 390 395 400 

Asp Phe Leu Asn Ser Cys Thr Pro Gly Met Arg Lys Trp Trp . Gin He 
405 410 415 

Lys Ser Gin Asn Phe Asp Leu Val He Cys Tyr Lys Val Gly Lys Phe 
420 425 430 

Tyr Glu Leu Tyr His Met Asp Ala Leu He Gly Val Ser Glu Leu Gly 
435 440 445 

Leu Val Phe Met Lys Gly Asn Trp Ala His Ser Gly Phe Pro Glu He 
450 455 460 

Ala Phe Gly Arg Tyr Ser Asp Ser Leu Val Gin Lys Gly Tyr Lys Val 
465 470 475 480 

Ala Arg Val Glu Gin Thr Glu Thr Pro Glu Met Met Glu Ala Arg Cys 
485 490 495 

Arg Lys Met Ala His He Ser Lys Tyr Asp Arg Val Val Arg Arg Glu 
500 505 510 

He Cys Arg He He Thr Lys Gly Thr Gin Thr Tyr Ser Val Leu Glu 
515 520 525 

Gly Asp Pro Ser Glu Asn Tyr Ser Lys Tyr Leu Leu Ser Leu Lys Glu 
530 535 540 

Lys Glu Glu Asp Ser Ser Gly His Thr Arg Ala Tyr Gly Val Cys Phe 
545 550 555 560 
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Val Asp Thr Ser Leu Gly Lys Phe Phe lie Gly Gin Phe Ser Asp Asp 
565 570 575 

Arg His Cys Ser Arg Phe Arg Thr Leu Val Ala His Tyr Pro Pro Val 
580 585 590 

Gin Val Leu Phe Glu Lys Gly Asn Leu Ser Lys Glu Thr Lys Thr lie 
595 600 605 

Leu Lys Ser Ser Leu Ser Cys Ser Leu Gin Glu Gly Leu lie Pro Gly 
610 615 620 

Ser Gin Phe Trp Asp Ala Ser Lys Thr Leu Arg Thr Leu Leu Glu Glu 
625 630 635 640 

Glu Tyr Phe Arg Glu Lys Leu Ser Asp Gly lie Gly Val Met Leu Pro 
645 650 655 

Gin Val Leu Lys Gly Met Thr Ser Glu Ser Asp Ser lie Gly Leu Thr 
660 665 670 

Pro Gly Glu Lys Ser Glu Leu Ala Leu Ser Ala Leu Gly Gly Cys Val 
675 680 685 

Phe Tyr Leu Lys Lys Cys Leu lie Asp Gin Glu Leu Leu Ser Met Ala 
690 695 700 

Asn Phe Glu Glu Tyr lie Pro Leu Asp Ser Asp Thr Val Ser Thr Thr 
705 710 715 720 

Arg Ser Gly Ala lie Phe Thr Lys Ala Tyr Gin Arg Met Val Leu Asp 
725 730 735 

Ala Val Thr Leu Asn Asn Leu Glu lie Phe Leu Asn Gly Thr Asn Gly 
740 745 750 

Ser Thr Glu Gly Thr Leu Leu Glu Arg Val Asp Thr Cys His Thr Pro 
755 760 765 

Phe Gly Lys Arg Leu Leu Lys Gin Trp Leu Cys Ala Pro Leu Cys Asn 
770 775 780 

His Tyr Ala lie Asn Asp Arg Leu Asp Ala lie Glu Asp Leu Met Val 
785 790 795 800 

Val Pro Asp Lys lie Ser Glu Val Val Glu Leu Leu Lys Lys Leu Pro 
805 810 815 
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Asp Leu Glu Arg Leu Leu Ser Lys He His Asn Val Gly Ser Pro Leu 
820 8 25 830 

Lys Ser Gin Asn His Pro Asp Ser Arg Ala He Met Tyr Glu Glu Thr 
835 840 845 

Thr Tyr Ser Lys Lys Lys He He Asp Phe Leu Ser Ala Leu Glu Gly 
850 855 360 

Phe Lys Val Met Cys Lys He He Gly He Met Glu Glu Val Ala Asp 
865 87 ° 875 880 

Gly Phe Lys Ser Lys He Leu Lys Gin Val He Ser Leu Gin Thr Lys 
885 890 895 

Asn Pro Glu Gly Arg Phe Pro Asp Leu Thr Val Glu Leu Asn Arg Trp 
900 905 910 

Asp Thr Ala Phe Asp His Glu Lys Ala Arg Lys Thr Gly Leu He Thr 
915 920 905 

Pro Lys Ala Gly Phe Asp Ser Asp Tyr Asp Gin Ala Leu Ala Asp He 
930 935 94Q 

Arg Glu Asn Glu Gin Ser Leu Leu Glu Tyr Leu Glu Lys Gin Arg Asn 
945 95 ° 955 960 

Arg He Gly Cys Arg Thr He Val Tyr Trp Gly He Gly Arg Asn Arg 
965 970 975 

Tyr Gin Leu Glu He Pro Glu Asn Phe Thr Thr Arg Asn Leu Pro Glu 
980 985 990 

Glu Tyr Glu Leu Lys Ser Thr Lys Lys Gly Cys Lys Arg Tyr Trp Thr 
995 1000 10 05 

Lys Thr He Glu Lys Lys Leu Ala Asn Leu He Asn Ala Glu Glu Arg 
1010 1015 1020 

Arg Asp Val Ser Leu Lys Asp Cys Met Arg Arg Leu Phe Tyr Asn Phe 
1025 1030 1035 1040 

Asp Lys Asn Tyr Lys Asp Trp Gin Ser Ala Val Glu Cys He Ala Val 
1045 1050 1055 

Leu Asp Val Leu Leu Cys Leu Ala Asn Tyr Ser Arg Gly Gly Asp Gly 
1060 1065 1070 
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Pro Met Cys Arg Pro Val lie Leu Leu Pro Glu Asp Thr Pro Pro Phe 
1075 1030 1085 

Leu Glu Leu Lys Gly Ser Arg His Pro Cys lie Thr Lys Thr Phe Phe 
1090 1095 1100 

Gly Asp Asp Phe lie Pro Asn Asp lie Leu lie Gly Cys Glu Glu Glu 
1105 1110 1115 1120 

Glu Gin Glu Asn Gly Lys Ala Tyr Cys Val Leu Val Thr Gly Pro Asn 
1125 1130 1135 

Met Gly Gly Lys Ser Thr Leu Met Arg Gin Ala Gly Leu Leu Ala Val 
1140 1145 1150 

Met Ala Gin Met Gly Cys Tyr Val Pro Ala Glu Val Cys Arg Leu Thr 
1155 1160 1165 

Pro lie Asp Arg Val Phe Thr Arg Leu Gly Ala Ser Asp Arg lie Met 
1170 1175 1180 

Ser Gly Glu Ser Thr Phe Phe Val Glu Leu Ser Glu Thr Ala Ser lie 
1185 1190 1195 1200 

Leu Met His Ala Thr Ala His Ser Leu Val Leu Val Asp Glu Leu Gly 
1205 1210 1215 

Arg Gly Thr Ala Thr Phe Asp Gly Thr Ala lie Ala Asn Ala Val Val 
1220 1225 1230 

Lys Glu Leu Ala Glu Thr lie Lys Cys Arg Thr Leu Phe Ser Thr His 
1235 1240 1245 

Tyr His Ser Leu Val Glu Asp Tyr Ser Gin Asn Val Ala Val Arg Leu 
1250 1255 1260 

Gly His Met Ala Cys Met Val Glu Asn Glu Cys Glu Asp Pro Ser Gin 
1265 1270 1275 1280 

Glu Thr lie Thr Phe .Leu Tyr Lys Phe lie Lys Gly Ala Cys Pro Lys 
1285 1290 1295 

Ser Tyr Gly Phe Asn Ala Ala Arg Leu Ala Asn Leu Pro Glu Glu Val 
1300 1305 1310 

lie Gin Lys Gly His Arg Lys Ala Arg Glu Phe Glu Lys Met Asn Gin 
1315 1320 1325 
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Ser Leu Arg Leu Phe Arg Glu Val 
1330 1335 

Val Asp Ala Glu Ala Val His Lys 
1345 1350 



Cys Leu Ala Ser Glu Arg Ser Thr 
1340 

Leu Leu Thr Leu lie Lys Glu Leu 
1355 1360 



<210> 27 

<211> 4244 

<212> DNA 

<213> Homo sapiens 

<400> 27 

gccgcgcggt agatgcggtg cttttaggag 
ggctgtcggt atgtcgcgac agagcaccct 
gagtgatgcc aacaaggcct cggccagggc 
ccccggggcc tctccttccc caggcgggga 
caggcccttg gcgcgctccg cgtcaccgcc 
gagatcggta gcgcctgctg cccccaccag 
ggccaagatg gagggttacc cctggtggcc 
aacattcatc cgcgagaaag ggaaatcagt 
cccaacaagg ggctgggtta gcaaaaggct 
ggaagcccag aagggaggtc atttttacag 
acgtgcagat gaagccttaa ataaagacaa 
tgagccctca gagccagaag aggaagaaga 
taagagtgaa gaagataatg aaattgagag 
atctaggcga agtagccgcc aaataaaaaa 
cattggtggc tctgatgtgg aatttaagcc 
aataagcagt ggagtggggg atagtgagag 
tcgaaagcgg aagagaatgg tgactggaaa 
ggaaacgccc tcagccacca aacaagcaac 
gagagctttc tctgcccctc aaaattctga 
tgacagtagt cgccctactg tttggtatca 
gagaagagat gagcacagga ggaggcctga 
tgtgcctgag gatttcctca attcttgtac 
gtctcagaac tttgatcttg tcatctgtta 
catggatgct cttattggag tcagtgaact 
ccattctggc tttcctgaaa ttgcatttgg 
ctataaagta gcacgagtgg aacagactga 
aaagatggca catatatcca agtatgatag 
taccaagggt acacagactt acagtgtgct 
gtatcttctt agcctcaaag aaaaagagga 
tgtgtgcttt gttgatactt cactgggaaa 
ccattgttcg agatttagga ctctagtggc 

35 



ctccgtccga cagaacggtt gggccttgcc 60 
gtacagcttc ttccccaagt ctccggcgct 120 
ctcacgcgaa ggcggccgtg ccgccgctgc 180 
tgcggcctgg agcgaggctg ggcctgggcc 240 
caaggcgaag aacctcaacg gagggctgcg 300 
ttgtgacttc tcaccaggag atttggtttg 360 
ttgtctggtt tacaaccacc cctttgatgg 420 
ccgtgttcat gtacagtttt ttgatgacag 480 
tttaaagcca tatacaggtt caaaatcaaa 540 
tgcaaagcct gaaatactga gagcaatgca 600 
gattaagagg cttgaattgg cagtttgtga 660 
gatggaggta ggcacaactt acgtaacaga 720 
tgaagaggaa gtacagccta agacacaagg 780 
acgaagggtc atatcagatt ctgagagtga 840 
agacactaag gaggaaggaa gcagtgatga 900 
tgaaggcctg aacagccctg tcaaagttgc 960 
tggctctctt aaaaggaaaa gctctaggaa 1020 
tagcatttca tcagaaacca agaatacttt 1080 
atcccaagcc cacgttagtg gaggtggtga 114 0 
tgaaacttta gaatggctta aggaggaaaa 1200 
tcaccccgat tttgatgcat ctacactcta 1260 
tcctgggatg aggaagtggt ggcagattaa 1320 
caaggtgggg aaattttatg agctgtacca 1380 
ggggctggta ttcatgaaag gcaactgggc 14 40 
ccgttattca gattccctgg tgcagaaggg 1500 
gactccagaa atgatggagg cacgatgtag 1560 
agtggtgagg agggagatct gtaggatcat 1620 
ggaaggtgat ccctctgaga actacagtaa 1680 
agattcttct ggccatactc gtgcatatgg 1740 
gtttttcata ggtcagtttt cagatgatcg 1800 
acactatccc ccagtacaag ttttatttga 1860 
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aaaaggaaat ctctcaaagg aaactaaaac aattctaaag agttcattgt cctgttctct 1920 
tcaggaaggt ctgatacccg gctcccagtt ttgggatgca tccaaaactt tgagaactct 1980 
ccttgaggaa gaatatttta gggaaaagct aagtgatggc attggggtga tgttacccca 2040 
ggtgcttaaa ggtatgactt cagagtctga ttccattggg ttgacaccag gagagaaaag 2100 
tgaattggcc ctctctgctc taggtggttg tgtcttctac ctcaaaaaat gccttattga 2160 
tcaggagctt ttatcaatgg ctaattttga agaatatatt cccttggatt ctgacacagt 2220 
cagcactaca agatctggtg ctatcttcac caaagcctat caacgaatgg tgctagatgc 228 0 
agtgacatta aacaacttgg agatttttct gaatggaaca aatggttcta ctgaaggaac 2340 
cctactagag agggttgata cttgccatac tccttttggt aagcggctcc taaagcaatg 2400 
gctttgtgcc ccactctgta accattatgc tattaatgat cgtctagatg ccatagaaga 24 60 
cctcatggtt gtgcctgaca aaatctccga agttgtagag cttctaaaga agcttccaga 2520 
tcttgagagg ctactcagta aaattcataa tgttgggtct cccctgaaga gtcagaacca 25S0 
cccagacagc agggctataa tgtatgaaga aactacatac agcaagaaga agattattga 2640 
ttttctttct gctctggaag gattcaaagt aatgtgtaaa attataggga tcatggaaga 2700 
agttgctgat ggttttaagt ctaaaatcct taagcaggtc atctctctgc agacaaaaaa 2760 
tcctgaaggt cgttttcctg atttgactgt agaattgaac cgatgggata cagcctttga 2820 
ccatgaaaag gctcgaaaga ctggacttat tactcccaaa gcaggctttg actctgatta 2880 
tgaccaagct cttgctgaca taagagaaaa tgaacagagc ctcctggaat acctagagaa 2940 
acagcgcaac agaattggct gtaggaccat agtctattgg gggattggta ggaaccgtta 3000 
ccagctggaa attcctgaga atttcaccac tcgcaatttg ccagaagaat acgagttgaa 3060 
atctaccaag aagggctgta aacgatactg gaccaaaact attgaaaaga agttggctaa 3120 
tctcataaat gctgaagaac ggagggatgt atcattgaag gactgcatgc ggcgactgtt 3180 
ctataacttt gataaaaatt acaaggactg gcagtctgct gtagagtgta tcgcagtgtt 3240 
ggatgtttta ctgtgcctgg ctaactatag tcgagggggt gatggtccta tgtgtcgccc 3300 
agtaattctg ttgccggaag ataccccccc cttcttagag cttaaaggat cacgccatcc 3360 
ttgcattacg aagacttttt ttggagatga ttttattcct aatgacattc taataggctg 3420 
tgaggaagag gagcaggaaa atggcaaagc ctattgtgtg cttgttactg gaccaaatat 3480 
ggggggcaag tctacgctta tgagacaggc tggcttatta gctgtaatgg cccagatggg 354 0 
ttgttacgtc cctgctgaag tgtgcaggct cacaccaatt gatagagtgt ttactagact ' 3600 
tggtgcctca gacagaataa tgtcaggtga aagtacattt tttgttgaat taagtgaaac 3660 
tgccagcata ctcatgcatg caacagcaca ttctctggtg cttgtggatg aattaggaag 3720 
aggtactgca acatttgatg ggacggcaat agcaaatgca gttgttaaag aacttgctga 3780 
gactataaaa tgtcgtacat tattttcaac tcactaccat tcattagtag aagattattc 3840 
tcaaaatgtt gctgtgcgcc taggacatat ggcatgcatg gtagaaaatg aatgtgaaga 3900 
ccccagccag gagactatta cgttcctcta taaattcatt aagggagctt gtcctaaaag 3960 
ctatggcttt aatgcagcaa ggcttgctaa tctcccagag gaagttattc aaaagggaca 4020 
tagaaaagca agagaatttg agaagatgaa tcagtcacta cgattatttc gggaagtttg 4080 
cctggctagt gaaaggtcaa ctgtagatgc tgaagctgtc cataaattgc tgactttgat 4140 
taaggaatta tagactgact acattggaag ctttgagttg acttctgaca aaggtggtaa 4200 
attcagacaa cattatgatc taataaactt tattttttaa aaat 4244 

<210> 28 
<211> 1128 
<212> PRT 

<213> Homo sapiens 
<400> 28 
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Met Ser Arg Arg Lys Pro Ala Ser Gly Gly Leu Ala Ala Ser Ser Ser 
1 5 io 15 

Ala Pro Ala Arg Gin Ala Val Leu Ser Arg Phe Phe Gin Ser Thr Gly 
20 25 30 

Ser Leu Lys Ser Thr Ser Ser Ser Thr Gly Ala Ala Asp Gin Val Asp 
35 40 15 

Pro Gly Ala Ala Ala Ala Ala Ala Pro Pro Ala Pro Ala Phe Pro Pro 
50 55 60 

Gin Leu Pro Pro His Val Ala Thr Glu lie Asp Arg Arg Lys Lys Arg 
65 70 75 so 

Pro Leu Glu Asn Asp Gly Pro Val Lys Lys Lys Val Lys Lys Val Gin 
85 90 95 

Gin Lys Glu Gly Gly Ser Asp Leu Gly Met Ser Gly Asn Ser Glu Pro 
100 105 no 

Lys Lys Cys Leu Arg Thr Arg Asn Val Ser Lys Ser Leu Glu Lys Leu 
115 120 125 

Lys Glu Phe Cys Cys Asp Ser Ala Leu Pro Gin Ser Arg Val Gin Thr 
130 135 140 

Glu Ser Leu Gin Glu Arg Phe Ala Val Leu Pro Lys Cys Thr Asp Phe 
145 150 155 leo 

Asp Asp lie Ser Leu Leu His Ala Lys Asn Ala Val Ser Ser Glu Asp 
165 ivo 175 

Ser Lys Arg Gin He Asn Gin Lys Asp Thr Thr Leu Phe Asp Leu Ser 
180 185 190 

Gin Phe Gly Ser Ser Asn Thr Ser His Glu Asn Leu Gin Lys Thr Ala 
195 200 205 

Ser Lys Ser Ala Asn Lys Arg Ser Lys Ser He Tyr Thr Pro Leu Glu 
210 215 220 

Leu Gin Tyr He Glu Met Lys Gin Gin His Lys Asp Ala Val Leu Cys 
225 230 235 240 

Val Glu Cys Gly Tyr Lys Tyr Arg Phe Phe Gly Glu Asp Ala Glu lie 
245 250 255 
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Ala Ala Arg Glu Leu Asn lie Tyr Cys His Leu Asp His Asn Phe Met 
260 265 270 

Thr Ala Ser lie Pro Thr His Arg Leu Phe Val His Val Arg Arg Leu 
275 280 285 

Val Ala Lys Gly Tyr Lys Val Gly Val Val Lys Gin Thr Glu Thr Ala 
290 295 300 

Ala Leu Lys Ala lie Gly Asp Asn Arg Ser Ser Leu Phe Ser Arg Lys 
305 310 315 320 

Leu Thr Ala Leu Tyr Thr Lys Ser Thr Leu lie Gly Glu Asp Val Asn 
325 330 335 

Pro Leu lie Lys Leu Asp Asp Ala Val Asn Val Asp Glu lie Met Thr 
340 345 350 

Asp Thr Ser Thr Ser Tyr Leu Leu Cys lie Ser Glu Asn Lys Glu Asn 
355 360 365 

Val Arg Asp Lys Lys Lys Gly Asn lie Phe lie Gly lie Val Gly Val 
370 375 380 

Gin Pro Ala Thr Gly Glu Val Val Phe Asp Ser Phe Gin Asp Ser Ala 
385 390 395 400 

Ser Arg Ser Glu Leu Glu Thr Arg Met Ser Ser Leu Gin Pro Val Glu 
405 410 415 

Leu Leu Leu Pro Ser Ala Leu Ser Glu Gin Thr Glu Ala Leu lie His 
420 425 430 

Arg Ala Thr Ser Val Ser Val Gin Asp Asp Arg lie Arg Val Glu Arg 
435 440 445 

Met Asp Asn lie Tyr Phe Glu Tyr Ser His Ala Phe Gin Ala Val Thr 
450 455 460 

Glu Phe Tyr Ala Lys Asp Thr Val Asp lie Lys Gly Ser Gin lie lie 
465 470 475 480 

Ser Gly lie Val Asn Leu Glu Lys Pro Val lie Cys Ser Leu Ala Ala 
485 490 495 

lie lie Lys Tyr Leu Lys Glu Phe Asn Leu Glu Lys Met Leu Ser Lys 
500 505 510 
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Pro Glu Asn Phe Lys Gin Leu Ser Ser Lys Met Glu Phe Met Thr lie 
515 520 525 

Asn Gly Thr Thr Leu Arg Asn Leu Glu lie Leu Gin Asn Gin Thr Asp 
530 535 540 

Met Lys Thr Lys Gly Ser Leu Leu Trp Val Leu Asp His Thr Lys Thr 
545 550 555 560 

Ser Phe Gly Arg Arg Lys Leu Lys Lys Trp Val Thr Gin Pro Leu Leu 
565 570 575 

Lys Leu Arg Glu He Asn Ala Arg Leu Asp Ala Val Ser Glu Val Leu 
580 585 590 

His Ser Glu Ser Ser Val Phe Gly Gin He Glu Asn His Leu Arg Lys 
595 600 605 

Leu Pro Asp He Glu Arg Gly Leu Cys Ser He Tyr His Lys Lys Cys 
610 615 620 

Ser Thr Gin Glu Phe Phe Leu He Val Lys Thr Leu Tyr His Leu Lys 
625 630 635 640 

Ser Glu Phe Gin Ala He He Pro Ala Val Asn Ser His He Gin Ser 
645 650 655 

Asp Leu Leu Arg Thr Val He Leu Glu He Pro Glu Leu Leu Ser Pro 
660 665 670 

Val Glu His Tyr Leu Lys He Leu Asn Glu Gin Ala Ala Lys Val Gly 
675 680 685 

Asp Lys Thr Glu Leu Phe Lys Asp Leu Ser Asp Phe Pro Leu He Lys 
690 695 700 

Lys Arg Lys Asp Glu He Gin Gly Val He Asp Glu He Arg Met His 
705 7 10 715 7 20 

Leu Gin Glu He Arg Lys He Leu Lys Asn Pro Ser Ala Gin Tyr Val 
7 25 730 735 

Thr Val Ser Gly Gin Glu Phe Met He Glu He Lys Asn Ser Ala Val 
7 40 745 750 

Ser Cys He Pro Thr Asp Trp Val Lys Val Gly Ser Thr Lys Ala Val 
755 760 765 
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Ser Arg Phe His Ser Pro Phe lie Val Glu Asn Tyr Arg His Leu Asn 
770 775 780 

Gin Leu Arg Glu Gin Leu Val Leu Asp Cys Ser Ala Glu Trp Leu Asp 
785 790 795 800 

Phe Leu Glu Lys Phe Ser Glu His Tyr His Ser Leu Cys Lys Ala Val 
805 810 815 

His His Leu Ala Thr Val Asp Cys lie Phe Ser Leu Ala Lys Val Ala 
820 825 830 

Lys Gin Gly Asp Tyr Cys Arg Pro Thr Val Gin Glu Glu Arg Lys lie 
835 840 845 

Val lie Lys Asn Gly Arg His Pro Val lie Asp Val Leu Leu Gly Glu 
850 855 860 

Gin Asp Gin Tyr Val Pro Asn Asn Thr Asp Leu Ser Glu Asp Ser Glu 
865 870 875 880 

Arg Val Met lie lie Thr Gly Pro Asn Met Gly Gly Lys Ser Ser Tyr 
885 890 895 

lie Lys Gin Val Ala Leu lie Thr lie Met Ala Gin lie Gly Ser Tyr 
900 905 910 

Val Pro Ala Glu Glu Ala Thr lie Gly lie Val Asp Gly He Phe Thr 
915 920 925 

Arg Met Gly Ala Ala Asp Asn He Tyr Lys Gly Arg Ser Thr Phe Met 
930 935 940 

Glu Glu Leu Thr Asp Thr Ala Glu He He Arg Lys Ala Thr Ser Gin 
945 950 955 960 

Ser Leu Val He Leu Asp Glu Leu Gly Arg Gly Thr Ser Thr His Asp 
965 970 975 

Gly He Ala He Ala Tyr Ala Thr Leu Glu Tyr Phe He Arg Asp Val 
980 985 990 

Lys Ser Leu Thr Leu Phe Val Thr His Tyr Pro Pro Val Cys Glu Leu 
995 1000 1005 

Glu Lys Asn Tyr Ser His Gin Val Gly Asn Tyr His Met Gly Phe Leu 
1010 1015 1020 
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Val Ser Glu Asp Glu Ser Lys Leu Asp Pro Gly Ala Ala Glu Gin Val 
1025 1030 1035 1040 

Pro Asp Phe Val Thr Phe Leu Tyr Gin He Thr Arg Gly He Ala Ala 
!045 1050 1055 

Arg Ser Tyr Gly Leu Asn Val Ala Lys Leu Ala Asp Val Pro Gly Glu 
1060 1065 1070 

lie Leu Lys Lys Ala Ala His Lys Ser Lys Glu Leu Glu Gly Leu He 
1075 1080 1085 

Asn Thr Lys Arg Lys Arg Leu Lys Tyr Phe Ala Lys Leu Trp Thr Met 
1090 1095 HOO 

His Asn Ala Gin Asp Leu Gin Lys Trp Thr Glu Glu Phe Asn Met Glu 
1105 1110 1H5 H20 

Glu Thr Gin Thr Ser Leu Leu His 
1125 



<210> 29 

<211> 4374 

<212> DNA 

<213> Homo sapiens 

<400> 29 

gggcacgagc cctgccatgt ctcgccggaa 
ctcagcccct gcgaggcaag cggttttgag 
atccacctcc tcctccacag gtgcagccga 
agcgccccca gcgcccgcct tcccgcccca 
cagaagaaag aagagaccat tggaaaatga 
ccaacaaaag gaaggaggaa gtgatctggg 
tctgaggacc aggaatgttt caaagtctct 
tgcccttcct caaagtagag tccagacaga 
aaaatgtact gattttgatg atatcagtct 
agattcgaaa cgtcaaatta atcaaaagga 
atcatcaaat acaagtcatg aaaatttaca 
gtccaaaagc atctatacgc cgctagaatt 
agatgcagtt ttgtgtgtgg aatgtggata 
gattgcagcc cgagagctca atatttattg 
tatacctact cacagactgt ttgttcatgt 
gggagttgtg aagcaaactg aaactgcagc 
actcttttcc cggaaattga ctgcccttta 
gaatccccta atcaagctgg atgatgctgt 
taccagctat cttctgtgca tctctgaaaa 
caacattttt attggcattg tgggagtgca 
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gcctgcgtcg ggcggcctcg ctgcctccag 60 

ccgattcttc cagtctacgg gaagcctgaa 120 

ccaggtggac cctggcgctg cagcggccgc 180 

gctgccgccg cacgtagcta cagaaattga 24 0 

tgggcctgtt aaaaagaaag taaagaaagt 300 

aatgtctggc aactctgagc caaagaaatg 360 

ggaaaaattg aaagaattct gctgcgattc 420 

atctctgcag gagagatttg cagttctgcc 480 

tctacacgca aagaatgcag tttcttctga 540 

cacaacactt tttgatctca gtcagtttgg 600 

gaaaactgct tccaaatcag ctaacaaacg 660 

acaatacata gaaatgaagc agcagcacaa 720 

taagtataga ttctttgggg aagatgcaga 780 

ccatttagat cacaacttta tgacagcaag 840 

acgccgcctg gtggcaaaag gatataaggt 900 

attaaaggcc attggagaca acagaagttc 960 

tacaaaatct acacttattg gagaagatgt 1020 

aaatgttgat gagataatga ctgatacttc 1080 

taaggaaaat gttagggaca aaaaaaaggg 1140 

gcctgccaca ggcgaggttg tgtttgatag 1200 
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tttccaggac tctgcttctc gttcagagct 
agagctgctg cttccttcgg ccttgtccga 
atctgttagt gtgcaggatg acagaattcg 
atacagccat gctttccagg cagttacaga 
aggttctcaa attatttctg gcattgttaa 
tgccatcata aaatacctca aagaattcaa 
ttttaaacag ctatcaagta aaatggaatt 
tctggaaatc ctacagaatc agactgatat 
agaccacact aaaacttcat ttgggagacg 
ccttaaatta agggaaataa atgcccggct 
atctagtgtg tttggtcaga tagaaaatca 
actctgtagc atttatcaca aaaaatgttc 
tttatatcac ctaaagtcag aatttcaagc 
gtcagacttg ctccggaccg ttattttaga 
ttacttaaag atactcaatg aacaagctgc 
agacctttct gacttccctt taataaaaaa 
cgagatccga atgcatttgc aagaaatacg 
tgtgacagta tcaggacagg agtttatgat 
accaactgat tgggtaaagg ttggaagcac 
tattgtagaa aattacagac atctgaatca 
tgctgaatgg cttgattttc tagagaaatt 
agtgcatcac ctagcaactg ttgactgcat 
agattactgc agaccaactg tacaagaaga 
ccctgtgatt gatgtgttgc tgggagaaca 
atcagaggac tcagagagag taatgataat 
ctacataaaa caagttgcat tgattaccat 
agaagaagcg acaattggga ttgtggatgg 
tatatataaa ggacggagta catttatgga 
aaaagcaaca tcacagtcct tggttatctt 
tgatggaatt gccattgcct atgctacact 
aaccctgttt gtcacccatt atccgccagt 
ggtggggaat taccacatgg gattcttggt 
cgcagcagaa caagtccctg att.ttgt.cac 
agcaaggagt tatggattaa atgtggctaa 
gaaagcagct cacaagtcaa aagagctgga 
caagtatttt gcaaagttat ggacgatgca 
ggagttcaac atggaagaaa cacagacttc 
tgaacaaaaa atggagaatt aaaaatacca 
tatctttgtg tgacatgtga gcataaaatt 
agaggttttt ctgaagacag tctttttcaa 
aacactcttg aatagacttc cactttgtaa 
aaagccttaa gtggcagaat ataattccca 
tgatattttt atttgtttca gttcagataa 
atccattgaa ctaaaataat tttattatgc 
tttttataag tagaaagaat tggccaggca 
tgggaggcca aggtaggcag atcacctgag 
tggcaaaacc ccatctttac taaaaatata 
ttagctgggc atggtggcgc acacctgtag 

42 



agaaacccgg atgtcaagcc tgcagccagt 1260 
gcaaacagag gcgctcatcc acagagccac 1320 
agtcgaaagg atggataaca tttattttga 13SC 
gttttatgca aaagatacag ttgacatcaa 14 4 0 
cttagagaag cctgtgattt gctctttggc 1500 
cttggaaaag atgctctcca aacctgagaa 1560 
tatgacaatt aatggaacaa cattaaggaa 1620 
gaaaaccaaa ggaagtttgc tgtgggtttt 1680 
gaagttaaag aagtgggtga cccagccact 1740 
tgatgctgta tcggaagttc tccattcaga 1800 
tctacgtaaa ttgcccgaca tagagagggg 1860 
tacccaagag ttcttcttga ttgtcaaaac 1920 
aataatacct gctgttaatt cccacattca 1980 
aattcctgaa ctcctcagtc cagtggagca 204 0 
caaagttggg gataaaactg aattatttaa 2100 
gaggaaggat gaaattcaag gtgttattga 2160 
aaaaatacta aaaaatcctt ctgcacaata 2220 
agaaataaag aactctgctg tatcttgtat 2280 
aaaagctgtg agccgctttc actctccttt 2340 
gctccgggag cagctagtcc ttgactgcag 2400 
cagtgaacat tatcactcct tgtgtaaagc 24 60 
tttctccctg gccaaggtcg ctaagcaagg 2520 
aagaaaaatt gtaataaaaa atggaaggca 2580 
ggatcaatat gtcccaaata atacagattt 2640 
taccggacca aacatgggtg gaaagagctc 2700 
catggctcag attggctcct atgttcctgc 2760 
cattttcaca aggatgggtg ctgcagacaa 2820 
agaactgact gacacagcag aaataatcag 2880 
ggatgaacta ggaagaggga cgagcactca 2940 
tgagtatttc atcagagatg tgaaatcctt 3000 
ttgtgaacta gaaaaaaatt actcacacca 3060 
cagtgaggat gaaagcaaac tggatccagg 3120 
cttcctttac caaataacta gaggaattgc 3180 
actagcagat gttcctggag aaattttgaa 3240 
aggattaata aatacgaaaa gaaagagact 3300 
taatgcacaa gacctgcaga agtggacaga 3360 
tcttcttcat taaaatgaag actacatttg 3420 
actgtacaaa ataactctcc agtaacagcc 3480 
atgaccatgg tatattccta ttggaaacag 3540 
gtttctgtct tcctaacttt tctacgtata 3600 
ttagaaaatt ttatggacag taagtccagt 3660 
agcttttgga gggtgatata aaaatttact 3720 
ttggcaactg ggtgaatctg gcaggaatct 3780 
aaccagttta tccaccaaga acataagaat 3840 
tggtggctca tgcctgtaat cccagcactt 3900 
gtcaggagtt caagaccagc ctggccaaca 3960 
aagtacatct ctactaaaaa tacgaaaaaa 4020 
tcccagctac tccggaggct gaggcaggag 4080 
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aatctcttga acctgggagg cggaggttgc 
gcttgggcaa cagagcaaga ctccatctca 
caagctttta aaaactagag cacagaagga 
ttgtcatagg attaagcagt ttaaagattg 
taataaatat ttaatgaata cttgctataa 



aatgagccga gatcacgtca ctgcactcca 4140 
aaaaagaaaa aagaaaagaa atagaattat 4200 
ataaggtcat gaaatttaaa aggttaaata 4260 
ttggatgaaa ttatttgtca ttcattcaag 4320 
aaaaaaaaaa aaaaaaaaaa aaaa 4374 



<210> 30 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : oligonucleotide 
primer 

<400> 30 

gatatctcca ctgacgtaag 20 



<210> 31 
<211> 19 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : oligonucleotide 
primer 

<400> 31 

tgttgccggt cttgcgatg 



<210> 32 
<211> 21 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : oligonucleotide 
primer 

<400> 32 

cccgatctag taacatagat g 



<210> 33 
<211> 21 
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<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : oligonucleotide 
primer 

<400> 33 

cagtctggat cgcgaaaact g 

<210> 34 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : oligonucleotide 
primer 

<400> 34 

ggtgattacc gacgaaaacg 

<210> 35 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : oligonucleotide 
primer 

<400> 35 

agtgaagggc gaacagttcc 

<210> 36 
<211> 18 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : oligonucleotide 
primer 

<400> 36 

gagtattgcc aacgaacc 
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<210> 37 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial 
primer 

<400> 37 

gtatcaccgc gtctttgatc 

<210> 38 
<211> 19 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial 
primer 

<400> 38 

cgaaacgcag cacgatacg 

<210> 39 
<211> 19 
<212> DNA 

<213> Artificial Sequence 

<220> 

<223> Description of Artificial 
primer 

<400> 39 

gttcaacgct gacatcacc 

<210> 40 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial 

45 



BNSDOCID: <WO 020548 56A1J_> 



Sequence : oligonucleotide 



20 



Sequence : oligonucleotide 
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Sequence : oligonucleotide 
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primer 
<400> 40 

catgttcatc tgcccagtcg 



<210> 41 
<211> 18 
<212> DMA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : oligonucleotide 
primer 

<400> 41 

gctttggaca taccatcc 18 



<210> 42 
<211> 18 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : oligonucleotide 
primer 

<400> 42 

caccgaagtt catgccag 18 



<210> 43 
<211> 21 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : oligonucleotide 
primer 

<400> 43 

tgactacttt tgacttcagc c 21 



<210> 44 
<211> 22 
<212> DNA 
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<220> 

<223> Description of Artificial Sequence : oligonucleotide 
primer 

<400> 44 

aaccattcaa catttttaac cc 
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